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Abstract 

Content delivery networks often employ caching to reduce transmission rates from the central server to the end users. Recently, 
the technique of coded caching was introduced whereby coding in the caches and coded transmission signals from the central server 
are considered. Prior results in this area demonstrate that carefully designing the placement of content in the caches and designing 
appropriate coded delivery signals from the server allow for a system where the delivery rates can be significantly smaller than 
conventional schemes. However, matching upper and lower bounds on the transmission rate have not yet been obtained. In this 
work, we derive tighter lower bounds on the coded caching rate than were known previously. We demonstrate that this problem can 
equivalently be posed as a combinatorial problem of optimally labeling the leaves of a directed tree. Our proposed labeling algorithm 
allows for significantly improved lower bounds on the coded caching rate. Furthermore, we study certain structural properties of our 
algorithm that allow us to analytically quantify improvements on the rate lower bound for general values of the problem parameters. 
This allows us to obtain a multiplicative gap of at most four between the achievable rate and our lower bound. 
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I. Introduction 

Content distribution over the Internet is an important problem and is the core business of several enterprises such as Youtube, 
Netflix, Hulu etc. The operation of such large scale systems presents several challenges, including (but not limited to) storage 
of the data, ensuring reliable availability and efficient content delivery. One commonly used technique to facilitate delivery is 
content caching Q]|. The main idea in “conventional content caching” is to store relatively popular content in local memory either 
on the desired device or in a device at the edge of the network such as an intermediate router. This local memory is referred to 
as the cache. Upon request, this cached content is used to serve the clients, thus reducing the number of bits transmitted from 
the server and thereby reducing overall network congestion. Note that even web browsers, routinely cache the content of popular 
websites on a local machine to speed up the loading of webpages. 

Historically, content caching algorithms have attempted to optimize the placement of content in the caches so that the average 
number of bits that are transmitted from the central server to the end users is minimized 0-0- This often requires some 
knowledge on the popularity of file requests @-|8| made by the users. Moreover, the typical approach is to cache a certain 
fraction of the file and to obtain the remaining parts from the server when the need arises. Coding in the content of the cache 
and/or coding in the transmission from the server are typically not considered. 

The work of |9J introduced the problem of coded caching, where there is a server with N files and K users each with a cache 
of size M. The users are connected to the server by a shared link (see Fig. [TJ. In each time slot each user requests one of the 
N files. There are two distinct phases in coded caching. 

• Placement phase: In this phase, the content of caches is populated. This phase should not depend on the actual user 
requests (which are assumed to be arbitrary). Typically, this placement phase can be executed in the off-peak hours where 
the amount of network traffic is low. 

• Delivery phase: In this phase, each of the K users request one of the N files. The server transmits a signal of rate R over 
the shared link that simultaneously serves to satisfy the demands of each of the users. 

The work of |9|| demonstrates that a carefully designed placement scheme and a corresponding delivery scheme achieves a rate 
that is significantly lower than conventional caching. While coded caching promises very significant gains in transmission rates, 
at this point we do not have matching upper and lower bounds on the (R, M ) pairs for a given N and K. 

In this work our main contribution is in developing improved lower bounds on the required rate for the coded caching problem. 
We demonstrate that the computation of this lower bound can be posed as a combinatorial labeling problem on a directed tree. 
In particular, our method generates lower bounds on aR + rIM , where a, /3 are positive integers. We demonstrate that a careful 
analysis of the underlying combinatorial structure of the problem allows us to obtain significantly better lower bounds than 
those obtained in prior work 0-0- In addition, our machinery allows us to show that the achievable rate of |9) is within a 
multiplicative factor of four of our proposed lower bound. 


The material in this work has appeared in part at the 2015 IEEE International Symposium on Information Theory. 
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Fig. 1: Block diagram of coded caching system. 


This paper is organized as follows. Section [II] discusses the background, related work and summarizes the main contributions 
of our work. Section [111] presents our proposed lower bound technique. The multiplicative gap between the achievable rate and 
our lower bound is outlined in Section IV Our proposed strategy also applies to certain variants of the coded caching problem 
that have been discussed in the literature; this is explained in Section [V] There have been some other approaches presented in 
the literature 0-03 for improving the lower bound on the coded caching rate. We present comparisons between our approach 
and the other approaches in Section [VT] We conclude the paper with a discussion of opportunities for future work in Section 

i7m 


II. Background, Related Work and Summary of Contributions 

In a coded caching system there is a server that contains N files, denoted W t , i = 1 ,,N, each of size F bits. There are K 
users that are connected to the central server by means of a shared link. Each user has a local cache memory of size MF bits; 
we denote the cache content by the symbol Z* (which is a function of W \,..., IT'y). In each time slot, the i -th user demands 
the file W d . where d t £ {1,..., N}. The coded caching problem has two distinct phases. In the placement phase, the content of 
caches is populated; this phase should not depend on the actual user requests (which are assumed to be arbitrary). In the delivery 
phase, the server transmits a potentially coded signal that serves to satisfy the demands of each of the users. A pair (M, R) is 
said to be achievable if for every possible request pattern (there are N K of them), every user can recover its desired file with 
high probability for large enough F. We let R*{M) denote the infimum of all such achievable rates for a given M. 

The coded caching problem can be formally described as follows. Let [m] = {1,... , to}, where to is a positive integer. Let 
{W n }n =i denote N independent random variables (representing the files) each uniformly distributed over [2 F ]. The i-th user 
requests the file W di , where di £ [IV], A ( M,R ) system consists of the following. 

• K caching functions, Z t = cf)i(Wi,..., Wn) where 0 t : [2 F ] —» [2^ FM J]. 

• A total of N h encoding functions <p dl ,..., dK (W\,..., Wn), so that the delivery phase signal X'd 1} ...,d K — <Pd 1 ,...,d K (Wii ■ ■ ■ ; WW). 
Here, tp dl ,...,d K ■ [2 F ] W -A [2^-1], 

• Lor each delivery phase signal and each user, we define appropriate decoding functions. There are a total of KN k of them. 

Lor the A-th user Pd x ,...,d K -,k{X dl dK , Z k ), k = l,...,K so that decoded file W dl dK - k = Wi ,...,d K ;k( X di,...,d K , Zk)- 
Here pt dl ,..., dK;k : [2^J] x [2L™J] [2 F ], 

The probability of error is defined as 

max max P{W dlt ... tdK - k ± W dk ). 

(di ,..., dK )e[N} K kz\K] 

Definition 1: The pair (M, R) is said to be achievable if for e > 0, there exists a file size F large enough so that there exists 
a (M, R) caching scheme with probability of error at most e. We define 

R*(M) = inf {R : (. M,R ) is achievable}. 

In this setting, it is not too hard to see that the best that a conventional caching system can do is to simply store an M/N 
fraction of each file in each of the caches. In order to satisfy the demands of the user, the server has to transmit the remaining 
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(1 — M/N) fraction of each of the K files. Thus the transmission rate (normalized by F) is given by 


Ru(M) = min(iV, K) ^1 - . (1) 

Note that min (N,K) is the transmission rate in the absence of any caching. In [91, the factor (1 — M/N ) is referred to as the 
local caching gain as it is gain that is obtained purely from the cache, without any optimization of the transmission from the 
server. In the setting where we perform nontrivial coding in the cache and delivery phase encoding functions, |9j demonstrates 
that a carefully designed placement scheme and a corresponding delivery scheme achieves a rate 


R C (M ) = K 



f 1 

N \ 

n) 

mm [1 + KM/N’ 

K) 


( 2 ) 


where M £ {0, N/K, 2N/K, ..., iV}. Other values of M are obtained by time-sharing between different solutions. 

The factor 1+I ^ M / N which definitely dominates when N > K is referred to as the global caching gain. It is to be noted 
that global caching gain depends on the overall cache size across all the users (owing to the term KM/N in the denominator) 
whereas the local caching gain only depends on the per-user cache size (owing to the term 1 — M/N). The work of |9) also shows 
that the rate Rc(M) is within a factor of 12 of the information theoretic optimum for all values of (V, K and M. Furthermore, 
they compare their achievable rate (cf eq. <[2}i) to a cutset bound that can be expressed as follows. 


R*(M ) > max 

{1,min( N, K )} 



(3) 


A. Related work 

Coded caching is related to but different from the index coding problem [T2)|. In the index coding problem, there are N' sources 
such that i-th source has message W t , i = 1,..., N' . There are K terminals, each of which has some subset of {Wi ,..., Wn 1 } 
available. In addition, each terminal requests a certain subset of the messages {Wi,..., Wn 1 }- The aim in the index coding 
problem is to minimize the number of bits that are transmitted on the shared link so that the demands of each user are satisfied. 
It is well recognized that the index coding problem for arbitrary side information is a computationally hard problem where 
nonlinear codes may be necessary ]T2) , JT?) . In particular, the optimal linear index code corresponds to minimizing the rank 
of an appropriately defined matrix over a finite field. This so called minrank problem |12| is also known to be computationally 
hard. It can be observed that for a fixed but uncoded cache content and a fixed set of demands of the various users, the problem 
of determining the optimal delivery phase signal in the coded caching problem is equivalent to an index coding problem. Note 
however, that in the coded caching problem, we allow the cache content to be coded. 

Since the original work of |9j, there have been several aspects of coded caching that have been investigated. Reference 14) 
considers the scenario of decentralized caching when the placement phase is driven by the users who randomly populate their 
caches with subsets of the files stored at the server. Approaches for updating the cache content are considered in and the 
case of files with different popularity scores are considered in fl6) and |[T7i), Jl8) . Security issues in this domain are considered 
in G3- The work of [20] | considers the more general case of hierarchical coded caching, where certain intermediate nodes in the 
network are equipped with potentially larger caches and investigates methods for minimizing the overall traffic in such networks 
(see also &■ Coded caching where each user requests multiple files was investigated in [22| . The case of device-to-device 
(D2D) wireless networks where there is no central server was examined in [23], [24] , Systems with files of differing sizes were 
examined in [25]. 

In addition to these contributions, there have been other lines of work that deal with content caching. In a parallel line of 
work | |23) , (26)— [28] consider the problem of femtocaching in a wireless setting where in addition to a central server (or base 
station), there are helpers (with caches) interspersed in a cell that help the end users satisfy their demands. The goal is again 
to consider caching strategies that minimize the overall rate, but the solution approaches do not consider the worst case rate 
over all possible demand patterns; instead the popularity scores of the different files are explicitly taken into account. Moreover, 
while coding is considered, it is conceptually different in the sense that the coding is only restricted to parts of the same file 
and coding across different files is not considered. 

There has also been parallel work on establishing lower bounds for the coded caching problem. In 1101, the Han’s inequality 
was leveraged to obtain an improved lower bound. A multiplicative gap of 8 between their lower bound and the achievable rate 
in eq. 0 ws established. The work of ED also presents a lower bound technique. As discussed in Section [VT] their technique 
can be considered as a special case of our work. The specific case of N = K = 3 was considered in 1291 via a computational 
approach. We compare our technique with these other approaches in Section [VI] 
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Fig. 2: An example of a coded caching system with N = 9 files, K = 3 users. Note that the proposed lower bound is better than the cutset 
bound and matches the achievable rate points at multiples of N/K. 


B. Summary of our contributions 

In this work our main contribution is in developing improved lower bounds on the rate for the coded caching problem. We 
show that the cutset based bound in eq. ([3} is significantly loose and propose a larger class of lower bounds that are significantly 
tighter. Our specific contributions include the following. 

• We demonstrate that the computation of our lower bound can be posed as a combinatorial labeling problem on a directed 
tree. Our method generates lower bounds on aR* + j3M, where a, (3 are positive integers. While the cutset bound only 
optimizes over at most min (TV, K) choices, our technique allows us to consider many more (a, ji) pairsQ 

• We demonstrate that a careful analysis of the underlying combinatorial structure of the problem allows us to obtain 
significantly better lower bounds than those obtained in prior work. For a given pair {a, f3) and number of users K, it is 
intuitively clear that the lower bound on aR* + (3M will be large if the number of files N is large. We define the notion 
of a saturated instance, which are directed trees and corresponding labelings that give the largest possible lower bound 
(using our technique) using as few files as possible. An analysis of saturated instances allows us to always improve on the 
cutset bound and in most ranges of M, our bound is strictly better. 

• Our machinery allows us to show that the achievable rate of (9) is within a multiplicative factor of four of our proposed 
lower bound for all values of N and I\ . This is possible by analyzing some combinatorial properties of saturated instances. 
Note that the multiplicative gap of four is currently the best known for this problem. 

• Our proposed technique also applies to other variants of coded caching problem. We discuss the application of our work 
to the case of D2D wireless networks and coded caching with multiple requests as well. 

As an example. Fig. [2] illustrates the tightness of the proposed lower bound for a coded caching system with a server that 
contains N = 9 files and K = 3 users. Specifically, our proposed bound demonstrates the optimality of the achievable scheme 
for values of M that are integer multiples of N/K in this specific case. 

III. Lower Bound on R*(M) 

In this section we present our proposed lower bound on R*(AI). We begin with an example that demonstrates the core idea 
of our approach. 

Example 1: Consider a coded caching system with N = K = 3. Then, the following sequence of information theoretic 


1 The cutset bound can be considered as a special case of our bound 
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inequalities hold. 

2 R*F + IMF > H{Z 1 ,X 123 ) + H(Z 2 , X 312 ) 

= /(W i; X 123 ) + H(Z 1; X 123 | Wi) + J(W i; Z 2 , X 312 ) + H(Z 2 , X 312 \Wi) 

= H(W 0 - ^WklZ,, X 123 ) +ff(Z 1 ,Xi 23 |W 1 ) + -ff(Wi|Z 2 , X 312 ) + F(Z 2 ,X 312 |W 1 ) 

( 6 ) 

> F(1 - e) + F(1 - e) + H(Z lt Z 2 , X\ 23 , X 332 | Wi) 

= 2F(l-e) + I(W 2 ,W 3 -,Z u Z 2 , X 123 , X 312 1 W 1 ) + H{Z U Z 2 , X 123 , X 312 1 W U W 2 ,W 3 ) 

(c) 

> 2F(1 - e) + 2F(1 - e) = 4F(1 - e), 

where equality (a) holds by the definition of mutual information. Inequality (b) holds by Fano’s inequality since the file W\ can 
be recovered with e-error from the pairs (Z i,Xl 23 ) and (Z 2 ,X 332 ) and by the fact that conditioning reduces entropy. Similarly, 
inequality (c) holds by Fano’s inequality since the files W 2 and W 3 can be recovered with e-error from (2j, Z>, X \ 23 , X 3 \ 2 ). 
This holds for arbitrary e > 0 and F large enough. Dividing throughout by F we have the required result. 

Thus, the key idea of the above bound is to choose the delivery phase signals in such a manner so that the various terms that 
are combined allow the “reuse” of the same file multiple times. For instance, in step (a) of the above bound, we use the definition 
of mutual information to rewrite the terms H(Z\, Xl 23 ) and F[{Z 2 , X 3 i 2 ). Note that both pairs (Z i, X 323 ) and (Z 2 ,X 3 i 2 ) allow 
the recovery of the same file W\, resulting in a contribution of 2 F to the lower bound. On the other hand, the files W 2 and W 3 
are recovered only once. The overall result is a lower bound of 4 F. 

Thus, our lower bound works with judiciously chosen labels for the delivery phase signals and combines them with the 
cache signals in an appropriate way such that a given file is recovered a large number of times. It turns out that doing this 
systematically and tractably requires the development of several new ideas. For instance, the aforementioned chain of inequalities 
can be equivalently represented in terms of a directed tree with appropriate labels on its leaves and edges as shown in Fig. [3] 
In particular, the leaves of the tree are labeled with cache signals Z\ and Z 2 and delivery phase signals X 323 and X 3 i 2 . Each 
internal node of the tree corresponds to the operation of combining the signals and its outgoing edge is labeled by the newly 
recovered file(s), e.g., at node u±, the file W\ is recovered. Likewise at node u*, the files W 2 and W 3 are recovered. The 
lower bound can be obtained by summing the cardinalities of the edge labels. Towards the goal of generating these bounds in a 
systematic manner, we introduce the following definitions. 

Definition 2: Directed in-tree. A directed graph T = ( V. A), is called a directed in-tree if there is one designated node called 
the root such that from any other vertex v £ V there is exactly one directed path from v to the root. 

The nodes in a directed in-tree that do not have any incoming edges are referred to as the leaves. The remaining nodes, excluding 
the leaves and the root are called internal nodes. Each node in a directed in-tree has at most one outgoing edge. We have the 
following definitions for a node v £ V. 

out(v) = {u£V : (v,u) £ A}, (outgoing neighbor) and, 
in(v) = {u £ V : (it, v) £ A} (incoming neighbor set). 
in — edge{y) = {e £ A : e = (u, i>)} (incoming edge set). 

In this work, we exclusively work with trees which are such that the in-degree of the root equals 1. There is a natural topological 
order in T whereby for nodes u £ T and v £ T, we say that u v if there exists a sequence of edges that can be traversed to 
reach v from u. This sequence of edges is denoted path(u,v). 

Definition 3: Meeting point of nodes in a directed tree. Consider nodes v-\ and v 2 in a directed in-tree T = ( V, A). We say 
that v\ and v 2 meet at node u if there exist path(vi,u) and path(v 2 ,u) in T such that path(vi,u) (T path{v 2 ,u) = 0. As there 
exists a path from any node in T to the root node, it follows that the existence of node u is guaranteed. 

Let D = U die [jv],...,d x e[iV]{^di,...,d A -}- 

Definition 4: Labeling of directed in-tree. Each node v £ T is assigned a label, denoted label(v), which is a subset of 
{FFi,..., W / at}U{2'i, ..., Zk}UD. Moreover, we also specify W(v) C { W\ ,..., Wn }, v ) C {Z\, ..., Zk} and 1D(?;) C D 
so that labeliy) = W(v) U Z(v) U 1D(i;). 

In our formulation, the leaf nodes are denoted Vi, i = 1,... ,£ are such that W(vf) = 0. 

Definition 5: We say that a singleton source subset {Wi} is recoverable from the pair (Zj,Xd 1 ,...,d K ) if dj = i. Similarly, for 
a given set of caches Z' C {Z i,..., Zk} and delivery phase signals D' C D, we define a set Rec(Z\ D') C {Wi ...., Wn} 
to be the subset of the sources that can be recovered from pairs of the form ( Zi,Xj ) where Z, £ Z' and J is a multiset of 
cardinality K with entries from [N] such that Xj £ D'. 

We let the entropy of a set of random variables equal the joint entropy of all the random variables in the set. We also let 

[a;] + = max(a;, 0). 
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Algorithm 1 Lower Bound Algorithm 

Input: T = (V, A) with leaves and {label(vi)}j =1 , such that W(vi) = 0, i = 1,..., i. 

Initialization: 

l: for it— L ... £ do 

2: W new (Vi) = A(Vi,Vi). 

3: %(vi,out(vi)) — ^^new • 

y(vi,out(vi )) I A^ new (Vi ) |. 

5: end for 

6 : while there exists an unlabeled edge do 

7: Pick an unlabeled node u € V such that all edges in in — edge(u) are labeled. 

8: W(zi) = U vein ( u) W(v) U W new {v). 

9: Z(u) = U vein(u) Z(v). 

10: D(zt) = LL^ n ( u )lD(z:). 

11: W new (u) = A(u,u) \W(u). 

12 : X( u ,out(u)) W^newi.^)- 

13: y(u,out(u)) I A^ new (u) |. 

14: end while 

Output: L = Y /eGA y e . 


{zy {x l23 } {z,} {x 3l2 } 



Fig. 3: Problem instance for Example [7] For clarity of presentation, only the W„ ew (u) label has been shown on the edges. 


Given a directed tree T with appropriate labels on its leaves we present an algorithm that generates an inequality of the form 
aR* + /3M > L(a,/3). For nodes u,v 6 T, we define the following. 

A(u,v) = Rec(Z(u),D(v)), and 

W new (u) = A(u,u)\W(u). (4) 

Algorithm fl] operates as follows. It takes as input a directed in-tree T where each leaf w,, i = 1 ,... ,t has labels r £(v,) and 
ro(t'i) (Wfz;, jis set to 0). The algorithm determines the files that are recovered at each v, and labels the corresponding outgoing 
edge with W new {vi) and \W new (vi)\. Following this, the algorithm propagates the labels further down the tree in the following 
manner. For a given node u whose incoming edges are labeled, we set Z(u) = U ve i n ( u )Z(v) and B(m) = U^ g j n („)D(t;), i.e., 
each of these labels is set to the union of the corresponding labels of the nodes that belong to the incoming node set of u. 
Next, it sets W(it) = U„ eirl ( J 1 )W(i;) UW new (v), i.e., in addition to the W-labels of the incoming node set, W (u) also contains 
the new files that are recovered on the incident edges. Note that at each internal node certain cache signals and delivery phase 
signals meet, e.g., Z\ and X\ 23 meet at node u\ in Fig. [3] The outgoing edge of an internal node is labeled by the new files that 
are recovered at the node, e.g., at ui the signals Z\ and A '123 recover the file W\. We call a file new if it has not been recovered 
upstream of a given node. In a similar manner at u* one can recover all the files W[ ...., W 3 ; however only the set { W 2 ■ W 3 } 
is labeled on edge ( u*,v*) as W\ was recovered upstream. This process is continued recursively, i.e., we label the outgoing 
edges with the new files that are recovered at node u, propagate the labels and continue thereafter. The algorithm continues until 
it labels the last outgoing edge. 

It can be seen that the operation of Algorithm [T] is in one to one correspondence with the new files recovered in the sequence of 
inequalities in the lower bound. For example, the outgoing labels of u± and u 2 in Fig. ^correspond to step (a) in the inequalities 
in Example [I] We formalize this statement in the Appendix (Lemma [3j where we show that a valid lower bound is always 
obtained when applying Algorithm [I] 

Definition 6 : Problem Instance. Consider a given tree T with leaves v. L , 1 = 1,.... I that are labeled as discussed above. Let 
a = Eli |E>(w)| and = Eli \ r ^(v t )\. Suppose that the lower bound computed by Algorithm [l] equals L. We define the 
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Fig. 4: For a given node u £ T, its in-neighbors are denoted iq and u r . The corresponding subtrees are denoted 77, f n and T u ( r ) 
and are shown enclosed in the dotted boxes. 


associated problem instance as P(T, a , (3, L , N , K). We also define a = | uf =1 B(i/j)| and f3 = | u| =1 Z(iq)|. A problem instance 

P(T, a, /3, L, A”, K) is said to be optimal if all instances of the form P'(T' , a , (3, Lf , At, A') are such that L' < L. 

It is worth emphasizing that a < a and (3 < /3 as some cache and delivery phase signals may be repeated. 

In the subsequent discussion, we focus on understanding the characteristics of optimal problem instances. Towards this end, 
we shall often start with a problem instance P and modify it in appropriate ways to arrive at another instance P 1 . For ease of 
presentation, when needed we shall refer to quantities in instance P(P') by using the corresponding superscripts. For example 
for a node u in P (P'), we will denote the set of new files by W^ ew {u) {W^ ew {u)). 

It is not too hard to see that it suffices to consider directed trees whose internal nodes have an in-degree at least two. In 

particular, if u has in-degree equal to 1, it is evident that W new (u) = 0 and thus, \W new {u)\ = 0. In addition, we claim that 
w.l.o.g. it suffices to consider trees where internal nodes have in-degree at most two. Therefore, we will assume that all internal 
nodes have degree equal to two. More specifically, we can show the following property of problem instances (the proof appears 
in the Appendix). 

Claim 1: Consider a problem instance P(T, a, 13. L, N , K) such that there exists a node u £ T with \in(u)\ > 3. Then, there 
exists another instance P'(T', a, (3, L', N, K) where L' > L and in (it) < 2 for all nodes u £ T' ■ 

Henceforth, we assume that all internal nodes in the problem instances under consideration have in-degree equal to two. Claim [I] 
can also be used to conclude that each leaf v in an instance P is such that either |Z(i/)| = 1 or |B(i/)| = 1 but not both. Indeed, 
if there exists a leaf v that violates this condition, we can use the modification in the proof of Claim [Tito replace v by a directed 
in-tree so that the condition is satisfied. If |Z(n)| = 1, we call v a cache node; if |ID(r>)| = 1 we call it a delivery phase node. In 
the subsequent discussion we will assume that the delivery phase nodes are labeled in an arbitrary order V \,..., v a and the cache 
nodes from u a+ i,..., v a+ p, where we note that a + f3 = £. Moreover, we let T> = {iq,..., v a } and C = {tta+i,..., v a+ p}. 

In the tree T corresponding to problem instance P(T, a, /3. L, N, K), consider an internal node u and the edge e = (u, v). In 
the subsequent discussion, we shall use T u to refer to the subtree that has its last edge as (it, out(u)), i.e., the subtree that is rooted 
at out(u). The incoming edges into it, denoted (it/, u) and (■ u r , it) are the last edges of the disjoint left and right subtrees denoted 
T u n) and T u (r) respectively (see Fig. [4b. Each of these subtrees defines a problem instance Pi = P{T u (i)i a i, f3i, Li, N , K) and 
P r = P(T u (r),&r, (3 r , L r , N, K). We denote the set of delivery phase nodes and cache nodes in T u ( r ) by 

Pu(r) — {it £ P . v £ 7T(r)} and 

Cu(r) = {tt £ C : V £ 77(r)}? 

with similar definitions for V u ^ and C u ^y We also let 

T3 U -Dud) C Pu(r)i and 
Cu Cu(l) ^3C u [r)- 

Let T/ = U vG -j- uW W new (v) and F r = U ve j- u{r) W new (v), i.e., T/ and T r are the subsets of {W \,..., that are used up in 
the problem instances Pi and P r respectively. It can be observed that T/ = A(iq,it/) and F r = A (u r ,u r ). 

We shall often need to reason about the files recovered at the node u from the different subtrees. For instance, the set of cache 
nodes in T u ( r ) and the delivery phase signals in T u (i) meet and recover a subset of the files at u. This set of files corresponds to 
those recovered from Z (u r ) \ Z( ui ) and B(it/), and can be informally thought of as the files recovered when going from right 
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Algorithm 2 Computing 

Input: P(T 7 a,l3,L,N 7 K), Array fl{u,5 u ), where u GT, 5 U C W new (u), |6 U | = 1. 

l: Initialization 

2: for all u € T, S u C W new {u ) where |<5„| = 1 do 

3: ^— O' 

4: end for 

3: end Initialization 

6: for i <- 1 to a do 

7: for all z/ G C do 

8 : Let u be the meeting point of Vi and v'. 

9: S u = A(v',Vi). 

10 : if S u G W new (u) and f l(u,S u ) == 0 then 

11 : ip(vi, v ') G- 1, and fl(u,S u ) •<— 1. 

12 : else 

13: 0. 

14: end if 

15: end for 

16: end for 


to left. Accordingly, we have the following definitions. 

A r i(u) = Rec(h(u r ) \ Z(zq), ©(zq)), and 
Ai r (u) = Rec(1j(ui) \ Z(zz r ), ©(zz r )). 

Note that by definition, we have 

A(u, u) = Rec(h(u), ©(zt)) 

= i?ec(Z(zq) U Z(zt r ), ©(zq) U D(zq-)) 

= i?ec(Z(zq), ©(zq)) U i?ec(Z(zz r ), ©(zq.)) U i?ec(Z(zq), ©(zq.)) U f?ec(Z(zq), ©(zq)) 

== f?ec(Z(zq), ©(zq)) U i?ec(Z(iz r ), ©(zq.)) U i?ec(Z(zq) \ Z(zq-), ©(zq.)) U i?ec(Z(zq.) \ Z (ui), ©(zq)) 

= A(Z(zq), ©(zq)) U A(Z(iq.), ©(zq.)) UA/ r (zz) U A ri (zz), and 

from Tu(i) from 7^,( r ) 

W(zz) = A(Z(zq), ©(zq)) U A(Z(zq), ©(zq.)), 

where (a) follows since the f?ec(Z(zq), ©(zq)) potentially contains some files that have already been recovered in i?ec(Z(zq), ©(zq)). 
The other equality holds because of similar reasoning. Therefore, it follows that 

W new (u) = A (u,u) \ W (u) 

= A r ,(u)UA, r (tt)\W(u). (5) 

Note that based on Algorithm |T] we can conclude that 

W(it) = U„ e{UriUi} W(z;) U W new (v) 

= U v yuW new (v) (by arguing inductively). ( 6 ) 

For the subsequent discussion, it will be useful to express the value of the lower bound L for an instance P(T. a, /3, L, N, K) 
in a functional form. In particular, we define the function z/> : V x C —> {0, 1} that allows us to express L in another way. For 
nodes zq € V,v' G C we can define their meeting point it G T. The function 'dfv,. v') is determined by means of Algorithm |2j 
where the sequence in which we pick the nodes v\ ,..., v a is fixed. Each element of W new (u) can be recovered from multiple 
pairs of nodes that meet there. The array Cl(u,S u ) keeps track of the first time the file S u is encountered. The function q(zq, v') 
takes the value 1 if the file W* recovered from the pair (Z(i/),©(i if)) at u belongs to W new (u) and has not been encountered 
before and 0 otherwise. A formal description is given in Algorithm [2] 

Claim 2: For an instance P(T, a, )3, L, N, K) the following equality holds 

i =1 v'£C 


( 7 ) 
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Proof: We first note that at the end of Algorithm |2j we have Cl(u,S u ) = 1 for all u £ T and all i5 u C W new (u) such that 
|5 U | = 1. To see this suppose that there is a u\ £ T and a singleton subset S Ul of W new (ui) such that 17(it 1 , S Ul ) = 0. Now S Ul 
is recovered from some delivery phase node and cache node, otherwise it would not be a subset of W new (ui). As our algorithm 
considers all pairs of delivery phase nodes and cache nodes, at the end of the algorithm it has to be the case that 17(iti, S Ul ) = 1. 

Next, we note that for each pair (ui,6 Ul ) where iti £ T and 5 Ul is singleton subset of W new (ui), we can identify a unique 
pair of nodes (vi,v') where v 4 £ V and v' £ C such that and £l(ui,6 Ul ) are set to 1 at the same step of the algorithm. 

The remaining pairs (vi, v') that cannot be put in one to one correspondence with a pair (iti, S Ul ) are such that ip(vi, v') are set 
to 0. Moreover as E„ g t Es u cw nem (u),\8 u \=i n ( u > <U = T, u er\ W new{u)\ = L, it follows that L = Ef=i E„' G c ■ 

We now illustrate the definitions introduced above by means of the following example. 

Example 2: The problem instance in Fig. B] has seven internal nodes, {vi 1 ,..., it 6 , it*}. In the initialization step. Algorithm [ 2 ] 
sets 17(iti, {W\}) = 0 for 1 < i < 4, 17(u,, {W 2 }) = 17(iij, {W 3 }) = 0 for i = 5, 6 and 17(it*, {W 4 }) = 0. In the next step, for 
node vi it sets = 1, 17(ui, {VPi}) = 1 (for U 5 £ C) and ip(v i,vq) = 1, 17(w 5 , {W 2 }) = 1 (for vq £ C). For V 7 £ C we 

have S u * = A(v 7 ,vi) = {W 3 } and since W 3 ^ W new (u*) = {W 4 } therefore ip(V\,V 7 ) = 0. By the same argument we have 
ip(vi : vs) = 0. Thus, the contribution of v\ to the lower bound, namely E^'gc = 2- The complete description of the 

steps after the initialization, is shown in Table [I] The table should be read in column order from left to right. Within a column, 
the order of the operations is from top to bottom. Note that there are two cases, V 3 £ T>, ve £ C and 1)4 £ V, Vq £ C where •) 
value is set to 0 (since the corresponding 17(-, -) values are already 1). In both cases 5 U * = {W 4 } and since W 4 is recovered 
already, 17(it*, {W 4 }) has already been set to 1 when considering V 2 £ V, V 7 £ C. Therefore ip(V 4 ,Vq) = V 3 ,Vq) = 0. Another 
point to be noted is that delivery phase node i >2 contributes three files towards L while the other delivery nodes contribute only 
two files each. 


setting 

Vi 

v 2 

v 3 

V 4 

v 5 

S U1 = Wi 
ip(vi,i’s) = 1 

fi(ui, W£) = 1 

<5u 5 = W 3 

V 5 ) = 1 

0(m 5 , W 3 ) = 1 

5„. = W 2 
^(v 3 ,v 3 ) — 0 

5 U . = Wx 
‘ 4 >{vi,ve) = 0 

^6 

<5 » 5 = W 2 
= 1 

n(u 5 , W '2 ) — 1 

<5 „ 2 = Wi 
ve) — 1 
n(« 2 , Wx) = 1 

= W 4 

i/>(v 3 ,v 6 ) = 0 
Q(u* , Wi) = 1 

= W 4 

x/>(v 4 ,v 6 ) = 0 
fi (u*,W 4 ) = 1 

v 7 

5 U * = W 3 

ipiv 1 , V 7 ) = 0 

5 U * = Wi 

y>{v 2 , v 7 ) — 1 
Q(u*, Wi) = 1 

<l 3 = Wx 
lfl(v 3 ,v 7 ) = 1 
Q(u 3 , Wi) — 1 

Su 6 = w 2 

il)(v 4 , V 7 ) = 1 

^(Ue, W 2 ) — 1 

^8 

8 U * = Wx 
if>(vi,vs) = 0 

<5„, = W 3 
■ 4 >(v 2 , v s ) = 0 

<5 U6 = W 2 
i’{v 3 , Vs) = 1 
f2(M6> W 2 ) = 1 

<L 4 = W 3 
x/>(v 4 ,v s ) = 1 

n(ii 4 , w 3 ) — 1 


TABLE I: The steps in Algorithm [ 2 ] after initialization when applied to Example [ 2 ] The steps flow from the leftmost to the rightmost column, 
and in each column from the top to the bottom row. 


Corollary 1 : For an instance P( 7 ~, a, / 3 , L, N, K), we have L < a min(/ 3 , K). Moreover, if N > a min(/ 3 , K ), then there 
exists an instance such that L = amin(/ 3 , A'). 
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Proof: For a node Vi, where 1 < i < a, we have 

Y i’iViiV 1 ) < I U„/ eC Z(u')l 

v’ec 

= 3, 

< min(/3, K). ( 8 ) 

Let u denote the meeting point of v' and Vi. The first inequality above holds since tp(vi,v') = 1 implies that S u = A (v',Ui) C 
W new (u ) and ^ 

Y ^i v ^ v ') ^ I u vec Rec(T)(vi),Z(v'))\ = |i?ec(!D(v i ),U„- 6 cZ(t;'))| < | U v > eC Z(u')|. 

v'ec 

From eq. ([8 1 we can conclude that L = J2v’ec v ') — ct min (f3,K). If N > a min(/3, A'), it is easy to construct an 
instance wrth L = a min(/3, AT). We simply pick any directed tree on a + 3 leaves. Let the cache node indices be Z 1 repeated 
3 — min(/3, K) + 1 times and Z 2 , Z 3 ,..., Z m i n rp t K)-h Zmintp, k)- Suppose that node v £ T>, v' £ C' meet at node u. We label 
the delivery phase leaves such that | U ( v y)eVxC' A(t/,'i>)| = a min(/3, K ). This can be done since N is large enough so that 
we can choose the labels such that Rec(Z(v[), B(ui)) IT iJec(Z(t4), B(i> 2 )) = 0 for vfv '2 £ C' and v\,v^ £ T>. For instance, 
initialize D(u) = 1 for all v £ V and then set !D(n;) = ,...,d K , dj = (i — l)a + j for j = 1,..., min(/3. K), and 

i = 1 ,..., a. ■ 

We illustrate the construction outlined above by means of the following example. 

Example 3: Let a = /3 = 2, K = 2, and N = 4. We arbitrary pick a directed tree with v-[, V 2 as delivery nodes and V 3 , 1:4 as 
cache nodes. We label ^(^ 3 ) = Z- t and Z(tt 3 ) = Z 2 , and delivery nodes as ID(ui) = X 12 and = A 3 4 . Such a problem 

instance is illustrated in Fig. [ 6 ] (a). As we will see later, this instance is not efficient in reusing files. 

At this point we have established that for a given problem instance P(T, a, /?, L, N, K), we can always generate an inequality 
of the form all* + (3M > L. It is natural to therefore consider the optimal problem instances that maximize the lower bound 
for a given value of a, /?, N and K. 

Definition 7: For given a,/3,N and K , we say that a problem instance IfiT ”, a, 3, L~. N, K) is optimal if all problem 
instances P' (T, ct, /3, L, N, K)) are such that L* > L. 

Recall that $ = \ uf =1 Z(Uj)|. For a problem instance P(T, a, /3, L, N, K), it may be possible that j3 < min(5. K). However, 
given such an instance, we can convert it into another instance where $ = min(/3, K) without reducing the value of L. In fact 
the following stronger statement holds (see Appendix [B] for a proof). 

Claim 3: For a problem instance P(T, a, 3- L, N. K) suppose that there exists an internal node u* with associated problem 
instance P* = P(T U * ,a*, ft*, L*, N*, K) such that the following condition holds. 

3* < min(/3 *,K). 

Then, there exists another problem instance P'{T',a, 3, L', N, K) where li > L such that the above condition does not hold. 
The next claim formalizes the intuitive fact that permuting the cache nodes and the delivery phase signals by the same permutation 
does not change the W labels and the lower bound of the instance. 

Claim 4: Let P(T, ex, /3, L, N, I\) to be a problem instance and let n : [J\] —► [A'] to be a bijective mapping with inverse 
a. Assume that the problem instance P'[T ', a, /?, L ', N, K) is obtained from P under the following changes for all v £ T> and 
v' £C, 

• assume % p (v) = Zi, then set Z p (v) = Z n u\, 

• assume D p (v) = X dl! ... tdK , then set D p '(u) = X d<r(1)j ... id<r(Jtr 

then W p ' w (u) = W p ew (u), W P '( M ) = W p (u) for u £ T, and L’ = L. 

Proof: We note that 

R,ec(Zi, X ( p ..., idK ) IPfij R^c(Z 7 r(i') j X da ^ r .. }d(T ( K y ) 

for i = Therefore, for any v £ V and v' £ C, we have A p (v',v) = A p (v',v) and more generally A p (u,u) = 

A p (u,u). From this and that W p (u) = A p (ui,ui) U A p (u r ,u r ), we have W p (u) = W p (u) for any u £ T. Using eq. (jiji, 
we have W p ew (u ) = W p ew (u ) for all u £ T 7 . Consequently, it follows that L' = L. ■ 

Henceforth, we will assume w.l.o.g. that (3 = min(/3, K) and that Claim [ 3 ] holds. Our next lemma shows a structural property 
of problem instances. Namely for an instance where L < amin(/3, A'), increasing the number of files allows us to increase the 
value of L. This lemma is a key ingredient in our proof of the main theorem (the proof appears in the Appendix). 

Lemma 1: Let P = I’(P. a, 3, L. K. N) be an instance where L < a inin(/L K). Then, we can construct a new instance 
P' = P{T', a, 3, L', K,N + 1), where L' = L + 1. 
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Fig. 6: (a) Problem instance P'(T' , a, /3, L, N', K), (6) problem instance P(T, a, p, L, N, K) where a = 2, /3 = 2 and K = 2. Both 
instances reach L = amin(/3, A') = 4 with different number of files N = 3 and N' = 4. 


Informally, another property of optimal problem instances is that the same file is recovered as many times as possible at the 
same level of the tree. For instance, in Fig. [3] W\ is recovered in both T u *(i) and T u *( r ). In fact, intuitively it is clear that the 
same set of files can be reused in any subtrees of an internal node. Our next claim formalizes this intuition. Recall that for a 
node u, Tg — (tr) and r r — 

Claim 5: Consider an instance P = P(T, a, f3, L, K, N). For all nodes u £ T, suppose w.l.o.g. that |F*| > |r r |. Suppose 
that there exist a node u € T such that such that F r ^ I’;. Then there exists another instance P'(T'. a, 3, IJ, A". K) such that 
N' <N,L'> L, and F r C T t for all u £ V. 

Next, we upper bound the maximum value of | W new (u) | for a node u £ T. 

Claim 6: In instance P(T, a, /3, L, N, AT), consider an internal node u. Let p(u) = cii[min(/3 r ., K — /3 z)] + + a r [mm(/3i, K — 
f3 r )] + . We have 

\Wnew(u)\ < min (p(u), N — |T ; U T r |). 

Proof: From eq. ([5]) it follows that 

\W new (u)\ < |A rl (u) \ W(u)| + |A, r («) \ W(u)|. 

Next, we observe that 

|A rI (ti)\W(u)| = |i?ec(Z(u r )\Z(u,),D(u,))\W(u)| 

< |lD(w)| X I Z(u r ) \ Z(u/)I 
(“) 

< Oil X min(p r , K - pi), 

= on x [mm(/3 r ,K - /3i)] + , 

where inequality (a) holds, since |D(u;)| = cq and |Z(u r )\Z(tq)| < min(/3 r , K — fii). Inequality (b) holds under the conditions 
Pi = min(/3j, I\) and f3 r = min(/3 r , AT) (see Claim[s]in Appendix). We can bound |A; r (u) \ W(rt)| in a similar manner. 

To conclude the proof we note that instances Pi and P r recover a total of F/ U F. r sources. As the total number of sources 
is N, \W new (u)\ < W — |T; U r r |. ■ 

Definition 8: Saturation number. Consider an instance P*(T*, a, fi, L*, N*, K), where L* = a min(/?, K), such that for all 
problem instances of the form P(T, a, /?, A*, N, AT), we have N* < N. We call N* the saturation number of instances with 
parameters (a,(3,K) and denote it by N sat (ot,fi,K). 

In essence, for given a, ft and AT, saturated instances are most efficient in using the number of available files. It is easy to see 
that N sat (a, (3, K) < amin(/3, A') since one can constmct an instance with lower bound amin(/3, K) when amin(/3, A') < N 
(see Corollary [T}. 

Example 4: Consider the two problem instances P and P' with a = 2, /3 = 2 and K = 2 that are shown in Fig. [6] The 
lower bound for both instances is L = a min(/3, K) = 4. However, instance P uses one file less than P'. This reduction is 
accomplished by reusing file IT| at both T u , (zt_and T n - ( r ). The instance P' can be treated as trivial instance constructed by the 
procedure suggested in the proof of Corollary TTI as it uses N' = a min(/3, AT) = 4 files. It can be verified by exhaustive search 
that P is one of the problem instances associated with N sat ( 2,2,2); therefore, N sat ( 2,2,2) = 3. 

Definition 9: Atomic problem instance. For a given optimal problem instance P(T. a. B. /.. N. K) it is possible that there 
exist other optimal problem instances Pfion , ft j, Li, N, K),i = 1,..., m with m > 2 such that a * = S”=i ft = ft anc l 

1 Li = L, i.e., the value of L follows from appropriately combining smaller problems. In this case we call the instance P 
as non-atomic. Conversely, if such smaller problem instances do not exist, we call P an atomic problem instance. 
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{Zx} {Xi 23 } {Z 2 } {X 312 } { Z 3 } {X 231 } 



{Z 2 } {X 3 i 2 } {Z 3 } {X 23 i} 



Pi (71 ,3,3,5, N, K) P 2 (T 2 , 1,1,1,N, K) P 3 (T 3 , 2,2,4, N, K) 

Fig. 7: Problem instances with N = K = 3. Instance Pi is non-atomic as the corresponding lower bound can be obtained by summing the 
lower bounds from P 2 and P 3 . 


Example 5: Consider the problem instance Pi shown in Fig. [7] with N = K = 3. The lower bound associated with this 
instance, 3 R* + 3 M > 5, can be obtained by combining the lower bounds acquired by P 2 and P 3 . Specifically, instance P 2 
yields R* + M > 1 and instance P 3 yields 2 R* + 2 M > 4. Note that in P\ the last edge (u*,v*) is such that W new (u*) = 0. 
Thus, the tree can be split in two separate instances at u*. Thus it is non-atomic. 

It is evident that instances where no new file is recovered in the last edge are non-atomic. However, we emphasize that there 
are other instances that are non-atomic as well. For example, consider instance P[, obtained from Pi where we change the label 
©( 113 ) to X 22 i. In P[, the labels of edges ( 124 , 12 *) and (u*,v*) will change to {W 2 } and {IT 3 } respectively; none of the other 
labels will change. Even though W new {u*) is nonempty in P[, but we still call it non-atomic since the associated lower bound 
does not change. 

The following theorem and its corollary are the main results of our paper and can be used to identify optimal problem instances. 


Theorem 1: Suppose that there exists an optimal and atomic problem instance P 0 (T = {V. A), a. 3. L 0 , N. K). Then, there 
exists an optimal and atomic problem instance P*(T* = (V*, ,4"), o:, /3, L*,N, K) where L* = L a with the following properties. 


Let us denote the last edge in P* 
Then, we have 


with (u* ,v*). Let P* = P{T*. {l) , a,, fa,L \, |F,|, K) and P r * = P(J*, (r) ,a r) p r ,L* r ,\T r \,K). 


u* (r) ’ 


L* = ai min(/3i, K), 

L* = a r min(/3 r , A'), and 

L* = min (a min(/3, K), L* + L* + N — No ), (9) 

where N 0 = ma x(N sat (ai, fa, K), N sat (a r , /3 r , K)^ Lurthermore, min (0i,l3 r ) < K. 

Proof: Note that we assume that the problem instance P Q is atomic. This implies that W J n '" w (u *) 7 ^ 0 and, consequently, 
N > (T/l, |r r |. Using Claim[3]we can assert that /3 1 = min (Pi,K) and /3 r = min(/3 r , K). 

We denote by (u*,v*), the last edge in P a . We let Pi = P(T u *(i), oq, Pu U, jr ; |, A') and P r = P{T u »(r),at r ,P r ,L r , |FJ, iT). 
It is easy to see that L 0 = Li + L r + \ W^° w {u*)\. Suppose that L; < ai min(^;, K). We apply the result of Lemma ll] by 
noting that |F; | < N, and conclude that there exists another instance P ; ** = P(Tff^,ai,/3i, L* + 1, |F; | +1, K) that can replace 
Pi, where the new file is denoted W*. We also note that in P a , W* £ Wp‘ w (u*)■ Let us denote the new instance P' Q . We 
emphasize that the nature of the modification in Lemma [l] is such that A p °(u*,u*) = A p °(u*,u*). Moreover, we note that 
W p o(u*) = W p °(m*) U {W*}. Thus, 

WnL{u*) = A p °(u*,u*)\W p °(u*) 

= A p '°(u*,u*) \ W p °(u*) U{W*} 

= W p ° w (u*)\{W*}. 

The problem instance P' Q is also optimal since L\ is increased by one and \ W p ° w [u*)\ is decreased by one, leaving L a unchanged. 
Therefore, moving files from W p ° w (u*) to either Pi or P r preserves optimality. In addition, from L' a = L 0 and that P a is 
atomic, P' is atomic. Based on this argument, we can immediately conclude that we cannot have L/ < ai min(/3;, K) and L r 
a r min(/3 T ., K) as the file W* can be used to simultaneously modify the instance P r . Upon this modification, we can conclude 
that L 0 can be increased by one, which contradicts the optimality of the instance P a . Thus we assume that L r = a r min(/3 r , K). 


2 As the instance is atomic, we have N > Nq. 
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Fig. 8: Comparison of the proposed lower bound and the cutset bound. 


We can repeatedly apply the operation of moving files from W^° w [u*) to Pi until we have L*[ = a; min( J 8;, K). It has to be 
the case that \W^° w {u*)\ > op min(/3z, K) — |F;| so that we can repeatedly apply the operation of moving the files, for if this 
were not true, the instance P a would not be atomic. 

We will denote the instance that we arrive at after completing these modification by P* which is optimal and atomic. We 
can also observe at this point that if we have /3; > K and f3 r > K so that [3] = ,8 r = K, then W^* w (u*) = 0 (by Claim pb 
which implies that the original instance P a is not atomic. Thus, either I3i or !3 r or both have to be strictly smaller than K. In 
the discussion below we assume w.l.o.g. that /3 r < K. It is easy to see that 

L*=Lt+L* r + \w£ w (u*)\. 

We define p(u*) = cni x [min(/3 r ,iC — f3i)] + + a r x [min(/3 i,K — /3 r )] + where p(u*) > p(u*) due to the fact that ai > oq 
and a r > a r . Using this and Claim |6j we have that 

\ W new( u *)\ < min (p(u*), N — max(|r*|, |r*|)). 

For an optimal instance, we claim that the above inequality is met with equality. If L* = a min(/3, K) there is nothing to prove. 
In this case, \W^ ew {u*)\ = amm(/3,K) — L* — L* = p{u*) (see Claim [ 9 ] in Appendix) and the above inequality is met with 
equality. 

Otherwise, we have L* < amin(/3, K) which implies p{u*) > \W^ ew (u*)\ and p(u*) > N — max(|r*|, |F*|). From the 
Claimj5j we can assume that either C F* or T* C F*. In P*, N used = max(|r*|, |F*|) + (u*) files are used so far. 

Now, if N > N use d, we can use Lemma [I] to conclude that there exists a problem instance P"(T" ,a, [3, L", N" : K) where 
N" = N usedj + 1 < N and L" = L* + 1. This is a contradiction since we assumed that P* is optimal. Therefore, N < N use d- 
In addition, since the number of available files is N thus N > N US ed- As a result, N = N used = max(|r^|, |r*|) + \W£* w (u*)\ 
and the inequality is met with equality. In both cases, we conclude that 

\ W nLi u *)\ = min {p{u*), N — max(|r* |, |r*Q). 

It follows that 


L* = min (a min(/3, I \), L'f + L* + N — max(|r*|, |r*|)). 

If L* = a min(/3. K) the saturated instance associated to N sat (a, fi, K) is an optimal instance. Otherwise, L* < amin(/3,A'), 
we have 


l^(^)l = iv-max(|rrMr:i) 

<N- ma,x(N sat (ai,/3i, K), N sat (a r ,(3 r , K)). 


( 10 ) 


We claim that for P* to be optimal, and P* have to be such that max(|r*|, |F* |) = max(JV sat (ct;, 0i, K), N sa t(a r , f3 r , K )). To 
see this we proceed as follows. Note that by the definition of saturation number, there exist problem instances P[{T {, ui , T/, LJ, Nj. K) 
and P' r {T'.. a r , /3 r , L' r , N' rl K) such that L[ = L *, L' r = L*, Nj = N sat (ai,j3i, K) and N' r = N sat (a r , f3 r . K). W.l.o.g. let assume 
N[ > N' r . By the Claims pi and pi problem instances P[ and P' r can be modified in such a way that f3[ = minfri/, K), B' r = 
min(/3 r , K) and TJ C F^.Wlso, By Claim |4j we can set U„ G c'Z(t;) = {Zi,...,Z^,} and U„ 6 c;^(t') = {Z K _^, +1 ,..., Zk}- 


This ensures that f3i — min (f3i,K), (3 r = min (f3 r ,K), and f3 = min(/3, K) hold in the defined problem instance. Now, consider 
the problem instance P' = P(T / , a, (3, LI , N, K ) with last edge (V, v') where Pf and P' r are instances of u[ and u' r respectively. 
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The instance P' uses N[ + \W^ ew (u')\ files. If N—N[ — \W^ ew [u’)\ > 1, then we are able to apply Lemma[I]./V— N[ — \W^ ew (u')\ 
times and come up with a modified version of P' so that either L' = a min(/3. K) or N — N[ — \Wff w (u')\ = 0. The first 
case cannot happen since by assumption P* is optimal and V < L* < a min (/3,K). Therefore, \W^ ew {u')\ = N — N[ and 
L' = L* + L* + N - N[. Finally, as L' < L* and L* < L* + L* + N - N[, we conclude that L' = L*. ■ 

Corollary 2: Suppose that there exists an optimal and atomic problem instance P 0 {T = ( V. A), a. f3 , L„. N. K). Consider 
problem instances P[(a[, /3[, L[, N, K) and P/(a[, j3' r . L ' r , N, K ) such that a[ + a' r = a and /3( + f3' r = (3 such that N > N ' 0 = 
msx[N sa t{a' v P[, K), N sat (a' r , /3' r , K)). Then we have 


L 0 > min (amin(/3, K ), L[ + L' r + N — N' 0 )). 

Proof: The result follows by applying the arguments in the proof of Theorem [T| to the problem instance where If and P* 
are replaced by P[ and I f respectively. ■ 

The following example demonstrates the effectiveness of Corollary [2] 

Example 6 : Consider a system with N = 64, K = 12 and cache size M = 16/3. The cut-set bound for such a system provides 
a lower bound R*(M ) > 77/27 = 2.852. Now, using the approach of Theorem [l] for a = 12, /3 = 8, (ai,j3i) = (a r ,/3 r ) = (6,4) 
yields 12P* + 8 M > min(12 x 8, 24 + 24 + 64 — N sat ( 6,4,12)). It can be shown that N sat (6 ,4,12) < 17 (see Algorithm [3] 
below). Therefore, R*(M ) > 157/36 = 4.361. This is significantly closer to the achievable rate of 5.5 (from j9]). 

Theorem [T] can be leveraged effectively if it can also yield the optimal values of a/, 3/ and a r ,/3 r . However, currently we 
do not have an algorithm for picking them in an optimal manner. Moreover, we also do not have an algorithm for finding 
N sat (a, /3, K ). Thus, we h ave to use Corollary [2] with an appropriate upper bound on N sat (a. 3. K) in general. 

Algorithm [3] in Section pi I-A provides a constructive algorithm for upper bounding N sot (a. 8 . K). Setting ai = [a/2], 
8 / = \fj 2] in Theorem [I] and applying this approach to upper bound the saturation number, we can obtain the results plotted 
in Fig. [8] 


A. An analytic bound on the saturation number 

Recall that the saturation number for a given a, j3 and K is the minimum value of N such that there exists a problem instance 
P(T,a, /3, L, N, K) with L = amm(/3, K). In particular, this implies that if we are able to construct a problem instance with 
N 1 files with a lower bound equal to a inm(/3, K), then, N sat (a, /3, K) < N'. In Algorithm [ 3 ] we create one such problem 
instance. 

The basic idea of Algorithm [3] is as follows. The first part focuses on the construction of the tree, without labeling the leaves. 
For a given a and /?, we first initialize a tree that just consists of a single edge (u*,v*). Following this, we partition a into 
two parts ai = [a/2] and a r = a — a;. On the other hand, /I is split into f3i = [/3/2J and f3 r = j3 — j3i. The algorithm, then 
recursively constructs the left and right subtrees of it*. It is important to note that the split in the (a, 8 ) pair is done in such a 
manner that each subtree gets the floor and the ceiling of the one of the quantities. Moreover, the labeling of the cache node 
leaves is such that for a given node it, |Z(it() (T Z(it r )| is as small as possible. The underlying reason for such a labeling is to 
ensure that the condition of Claim [3] doesn’t hold for any u G T. 

Following, the construction of the tree, the second phase of the algorithm labels each of the delivery phase nodes, so that the 
computed lower bound is L = a/3. In this step we use N = a/3 files (see the procedure discussed in the proof of Corollary [TJ. 
In the third and final phase of the algorithm we modify the instance so that for any node u £ T, we have that either 1’/ C F r 
or r r C F/; we use Claim [5 to achieve this. In the beginning all recovered files in the constructed instance are distinct so that 
T{ui)r\T(u r ) = 0 for all nodes u. W.l.o.g. assume that |r(u r )| < |r(iq)|. An application of Claim[5]will thus cause a significant 
reduction in the number of files that are used. The following lemma quantifies this reduction. 

Lemma 2: For given a, /3 and K if /3 < K then. 


N sat (a,p,K) < 


2a/3 -{- a T /3 
3 


Proof: We use Algorithm [ 3 ] to generate problem instance P(T, a, /3, L, N sat: K) so that L = a/3. By the definition of the 
saturation number we have N sat (a, /3, K) < N sat hence we just need to show that N sat < 2a ^+ a +^ . 

First, we need to show that L = a/3. By line|32lof the algorithm the file l / l , /_j )s+r is recoverable in instance If by the 
pair (B(u t ), Z(u Q + r )) or equivalently A(vt, v a +r T= 3 +r for 1 < t < a and 1 < r < /3. On the other hand, W(v*) = 

Uf =1 uf =1 A(v t , v a+r ) therefore W(u*) = {IT/,..., W a/3 }. Recall that W{v*) = C u&To W new {u) and L 0 = J2 u <bT 0 \ W new(u)\ 
so we have Lq > |W(rC)| = a/3. But Lq < a/3, by Corollary |2j therefore Lq = a/3. In phase III of the Algorithm (Modify 
Delivery Phase Signals) using Claim [5] we have L > L$ and since L < a/3 and Lq = a/3 thus L = a/3. 

W.l.o.g we set left incoming node such that r(zi r ) C r(iq). Starting from the root node v *, we let the set {zio, Ui,..., u t } and 
{u>o,..., tut_i} to be the left and right incoming nodes respectively so that Ui is topologically higher than Uj for i < j, ut = u* 
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Algorithm 3 Instance construction for upper bounding N sat (a, 0, K) 

Input: a, /3 and K. 

1: Initialization 

2: Let (u*, v*) be last edge and set Unew = {w*}. 

3: Set Z(u*) = {Zi,Z 2 ,..., ^mm(/ 3 ,K)} and b(u*) = f), a(u*) = a. 

4: C = 0 and V = 0. 

5: end Initialization 

6: procedure Tree Construction & Cache nodes labeling 
7: while Unew is nonempty do 

8: Pick u £ Unew, create nodes ui and u r , edges ( ui,u ) and (u r ,u), add them to 7o- 

9: Set a(ui) = \a(u)/ 2], b{ui) = [b(u)/2\ and a(u r ) = a(u) — a(ui), b(u r ) = b(u) — b(ui). 

10: Set Z(tii) and Z(tt r ) be subsets of Z(u) of sizes min (b(ui),K) and min(6(w r ), K) respectively with minimum intersection. 

11: Remove u from Unew 

12: if a{u{) + b{ui) > 2 then 

13: Add Ul to Unew • 

14: else 

15: If b(ui) == 1 add ui to V otherwise to C. 

16: end if 

17: if a(u r ) + b(u r ) > 2 then 

18: Add U r to Unew 

19: else 

20: If b(u r ) == 1 add u r to V otherwise to C. 

21: end if 

22: end while 

23: end procedure 

24: procedure Delivery nodes labeling 
25: Let V = {ui,. . . , v a }. 

26: for r = 1,..., min(/?, K) do 

27: Pick a node v G C with Z(u) = { Z r } and denote it by v r + a - 

28: end for 

29: Let C \ , . . . , tta+min(/3,jr) } {Va-J-minf/t.iri + l) ■ ■ ■ j . 

30: for t = 1,..., a do 

31: for r = 1,..., min(/3, K) do 

32: d r = (t — 1) min(/3, K) + r. 

33: end for 

34: for r = min(/3, K) + 1,..., K do 

35: d r = 1. 

36: end for 

37: SetB(vt)mX du ...' dK 

38: end for 

39: end procedure 

40: procedure Modify Delivery phase signals 

41: Denote current instance by Po(7o, a, f3, L(uNq. K). 

42: Modify Po(To,a, /3, Lo,No, K) by Claim [ 5 ] to obtain P(T, a, /3, L , N sa t, K). 

43: end procedure 

Output: N sat (a, /3, K) = |r(u*)|, P(T, a, /3, L, N sat , K). 


and uo to be a leaf. This is depicted in Fig. [ 9 ] Recall that r(zt) = W new (u) Lir(it;) ur(u r ) and W new (u) (T (r(uj) U r(it r )) = 0 
for any u G T. Therefore, recursively we have, 

N sat = |r(u*)| = \T(u t )\, 

= \W new (u t )\ + |T(u t _i)|, 

t 

= Y, \ W ne W {Ui)\, (ID 

i—1 

where we used W new (uo) = 0 since u 0 is a leaf. 

In Algorithm [5] a(u) and b(u) denote the number of delivery phase nodes and the number cache nodes, respectively in the 
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subtree rooted at u. Note that by definition, we have 

L = \W new (u t )\ + ^2 I W new (u)\+ ^2 \W new (u)\. 

U ^T U t _i uGT Wt _ 1 

Using Corollary |2l we conclude that J2 u &T Ut _ 1 \ W new(u)\ < a(u t _i)&(u t _i) and J2 u <ET Wt _ 1 I W new (u)\ < a(tu t _i)6(iu t _i). 
Similarly, using Claim [6] we have that |W ne i«(wt)| < a(ut-i)b(wt-i) + a(wt-\)b(ut-\). In fact, all these inequalities are met 
with equality. This can be seen as follows. An application of Claim [5] does not change the lower bound, which implies that 
L = a(3 = a{ut)b[ut). But, a(ut) = a(ut- i) + a(wt- i) and b(ut) = b[u t -i) + b(wt-i) so that 

L = a(u t -i)b(w t -i) + a(w t -i)b(u t -i) + a(u t -i)b{u t -i) + a(w t -i)b(w t -i). 

An inductive argument can be made to show a similar result for Ui, i = 1,..., t — 1. 

Using these results and the equality in CD yields, 

a/3 = L , 

^ ' I new (tt) | - 

ueT 

t t -1 


^2 | 'Wnew (^) | j 
2 — 0 2=0 ueTwi 

t -1 

Nsat + ^2 > 


2=0 
t- 1 


N sat = a/3 - y^ y a(wi)b(wi). 


( 12 ) 


i =0 


Considering our setting for a(u) and b(u) in the line [9] of Algorithm [3] we have 

a(u t+ 1 ) = a{ui) + a(wi), b(u i+1 ) = b(ui) + b(wi), (13) 

for 0 < i < t- 1 and either ( a(ui),b(ui )) = (|'a(u i+ i)/2], [b(u l+ i)/2j) or (a(tij), 6(u»)) = (La(u i+ i)/2j, \b(u i+1 )/2 ]). In 

any case using eq. (13 i we have 

a{ui) < |”a(ttj+i)/2], 

a(u i+ 1) + 1 


< 


a(zti) + a(txii) + 1 


a(ui) < a(wi) + 1. 


By a similar argument we have b(ui) < b(wi) + 1. Using eq. (131 recursively, it is easy to see that a = a(uo) + Y2i=o a(wi) 
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Fig. 9: Saturation path 


and /3 = b(uo) + o b( w i)- Therefore, using eq. (12 1 and (111, 


Nf,nt — 


a/3 - YX*>i)b(wi), 
i =o 

t—i 

= ^2 ( a ( u i)H w i) + a(wi)b(ui)) , 

i =0 
t -1 

< ^2 ( l a ( w i ) + + a(^)[6(^) + 1]), 

2=0 

t-1 

< E ( 2 a(wi)b(wi) + a(wi) + b(wi)), 

i —o 

t-i 

a + [3 + 2 ^ a(wi)b(wi), 


< 


=> ^ ~2a(wi)b(wi ) > 


i=0 

a/3 — a — /3 


*=o 


Finally, using the above inequality and eq. ( [T2| , we have 

N sat (a,/3,K) < N sat , 

t -1 

= a/3 - y^ y a(wj)l3(wi), 


i=0 


< a/3 — 

Furthermore as N sat (a, f3, K) is an integer we conclude that 

N sat (a,P,K) < 


a/3 — a — /3 2a^ + a + , 


2 a/3 + a + /3 


The aforementioned proposed upper bound on the saturation number is tight. To see this, let consider f3 = 1. It is easy to see 
that N sa t (a, 1 ,K) = a and using Lemma [2] we have N sat < [a + 1/3J = a. 


IV. Multiplicative Gap between upper and lower bounds 
We now show that for any set of problem parameters, our proposed lower bound and the achievable rate of [9j| in eq. 0 are 
within a factor of four, i.e., we show the following result. 

Theorem 2: Consider a coded caching system with N files and K users each with a normalized cache size M. Then, 


7 (M) 


R C {M) 

R*(M) 


<4, 


for 0 < M < N. 
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The key idea i n proving this result is to exploit the analytical upper bound on the saturation number N sat (a, 0, K) proposed 
in Section III-A For a given TV and A, we consider three distinct regions of M. For each range, an appropriate (a, 0) pair 


allows us obtain a lower bound on the rate that is within a factor of four of the achievable rate. 

Proof: 

We use Corollary [2] with the 2a and 28, so that P[ and //' have parameters a and /3.This gives us the following lower bound. 

2aR*(M) + 28M > min (2a min(2/3, K), 2a0 + [TV - TV 0 ]+) , 

Moreover, we restrict 2/3 < K so that. 


2 aR*(M) + 2/3M > min (4a/3, 2a0 + TV - TV 0 ) 

TV- N 0 


R*(M ) > min 2/3,/3 + 


2a 


- -M. 


( 14 ) 


Our first observation is that for min(TV, K) < 4, the bound is easily seen to be true. Towards this end, by setting a = N, 0 = 1 
in ( fl4l ), we obtain 

M 

R*(M) > 1 — —. 

where we used N sat (N, 1, I\) = TV. Furthermore, from eq. (|2|, 

R C (M) < m in (N, I<) (1 - M/TV), 

This means that 'y(M) = min(TV, K ) < 4 for min(TV, K ) < 4. 

Thus, in the subsequent discussion, we only consider min (TV, K) > 5. As in 0, we divide the M- axis to three separated 
regions. For given M, we explore the space of (a, /3) pairs to obtain an appropriate lower bound that allows us to show the 
multiplicative gap of four. 


A. Region I: 0 < M < max(l, 7V/A') 

First, we consider the range 0 < M < 1. In eq. (14 1 we set a = 1, /3 = [mm(N, K)/2\. By such a setting we have 
2 8 < min (A, K ) < K and N > N sat ( 1, p, K) = /3. Therefore for M < 1, 

R*(M) > min ^2/3, - PM 

(“) . ( R TV-/3 

> mm p,- 


W . / min(A^, K) — 1 N — min(iV, K)/2 
> mm^ g ’ 2 

W . / min(TV, K) — 1 min(A r , K) 

~ mm ^ 2 ’ 4 

(4) min (TV, K ) 


> 


min(TV, A')(l — M/TV) 


> Rc(m)/a. 

Here, (a) holds since M < 1, (6) holds since (min(TV, K) — l)/2 < /3 < min(TV, K)/2, (c) holds since TV > min (TV, K), and 
(d) holds since min(TV, K) > 2. 

Next, consider the range M £ \ 1, N/ K]. Note that we only need to consider the scenario where TV > K. The achievable rate 
R C (M) in this interval is upper bounded by the convex combination of the rates A c (0) and R C (N/K) so that 

R C (M) < AR C (N/K) + (1 - X)R c (0) = A(1 - A/2) - A/2, 

where A = AM/TV. Now, we set a = \N/K],8 = L A 7 2 J so that a P < (TV/A + l)A/2 = N/2 + A/2 < TV. As, 
N sa t(a, /3, K) < a/3, this means that TV > N sat (a, 0, K). In addition, note that 20 < K. Therefore, we can use eq. (14 1 to 
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obtain 


R*(M) > min (2/3, /3 + — — N *<*( a 'P' K ) 1 _ @- M) 
[ 2a J a 


(a 


> min <2/3(1 


M\ 2/3 N — 2/3M /3 


> min 


• K 
> mm < — 
2 


(c) 

> min 


{(«■-!) (l- 


6a 


1 - 


2a ) ’ 3 ' 2a 

KM\ /3 3V-2/3M 

2N ) ’ 2 + 4N/K 

/\ 1 


A\ 0 


d- A ) + T-« 


(d) 

> min 


(e) 

> «c(M)/4, 


f R C (M) Kf X\ (1 — A) 1) 

l 2 ’ 2 ^ 2/ 4 6J ’ 

| R C (M) R C (M) (K- 3) ^ A^ 


(15) 


where in (a) we used Lemma [ 2 ] to bound N sat (a,/3,K), in (b) we used N — 2/3M > 0, 1 < a < N/K + 1 < 2 N/K, 
(K — l)/2 < /3, and in (c) we used /3 > (K — l)/2, A = KM/N and the expression for the upper bound on R C (M) 
above. Next, (d) holds because of the achievable rate bound and (e) holds since mm(N, K) > 5. Therefore, 7 (M) < 4 for 
M € [1 ,N/K\ and N > K. Thus, we conclude that we have 7 (M) < 4 for M € [0, max(l, N/K)]. 


B. Region II: max(l, N/K) < M < N/2 

For any M € [max(N/K , 1), JV/2] we define to = [KM/N\ so that toN/K < M < (to + 1 )N/K. Since M > N/K thus 
to > 1. Using eq. Q, it turns out that, 

R C (M) < R c (t 0 N/K), 

K t 0 
to + 1 fo + 1 ’ 

(«) K 1 
“ KM/N ~ 2’ 

_ N 1 
_ M ~ 2’ 


where (a) holds since to + 1 > KM/N and to > 1. 

Now, consider setting a = [2MJ and /3 = \_N/2M\. With this setting we have a > 2 (since M > 1), /3 > 1 (since M < N/2), 
and /3 < N/2M < K/2 (since M > N/K). Furthermore, since a/3 < 2M x N/2M = N and N sat (a, (3, K) < a/3 therefore 
N > N sat (a, /3, K). This together with 2/3 < K implies that such a setting is a valid setting to use (14 1 . Therefore, using 
Lemma [ 2 ] to bound N sat (a, /3, K), we have 

R*(M) > — ^ P 


■ roo 2/3 , N /3 l n 

mm{2/? ’ T + 2^ - to - 6 } 


-M. 


We claim that 2/3 > 2/3/3 + N/2a — (3/6a — 1/6 or equivalently 8a/3 + a + /3 > 3A/. This can be seen as follows. When, 
N/4 < M < N/2 we have a > N/2, /3 = 1, so that this holds. On the other hand when max(l, N/K) < M < N/ 4, we have 
a > 2 M - 1, f3 > N/2M - 1, so that 8a/3 + a + /3 > 8N - 7(N/2M + 2M) + 6. It can been seen that N/2M + 2 M < N/2 + 2 
for 1 < M < N/4 therefore 8a/3 + a + f3 > 9N/2 — 8 > 3 N for N > 6. For N = 5, the claim trivially holds since a > 2,/3 > 1 
so that 8a/3 + a + ^>19>3xiV = 15. 
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Thus, we have 


R*(M ) > 


(a) 

> 


( 6 ) 

> 

> 

> 


2/3 ( TV — 2/3M 
3 + 2 a 

7/3 ( TV-2/3M 

12 + AM 
TV /3 _ 1 

4M + 12 ~~ 6 ’ 
/V 1 

4M ~~ 12’ 

TV 1 

4M _ 8 

Rc{M) 

4 ’ 


1 

6 a 6 ’ 

1 

6 ’ 


where in (a) we used TV — 2/3 M > 0, a > 2 and a < 2 M and in (b) we used /3 > 1. Eventually, 7 (M) < 4 for max(TV//\, 1) < 
M < TV/2. 


C. Region III: TV/2 < M < TV 

Let To = so that M > t$N/K for M £ (TV/2, TV]. For any M £ (TV/2, TV] the convex combination of rate R c (toN/K) 

and R C (N) gives us R C (M) < XR c (toN/K) + (1 — X)R C (N) = XR c (toN/K) where M = Xt^N/KX- (1 — A)TV or equivalently 
A = (1 — M/N)/(l — to/K). According to this and eq. ([2]) we observe that, 

R C (M) < XR c (t 0 N/K), 

= (1 - M/TV) (K - fp) 

" (1 - t 0 /K) (t 0 + 1) ’ 

_ K(1 — M/N) 

(1 + to) 

(“) K( 1 - M/TV) 

“ TV 72 ’ 

= 2(1 - M/TV), 

where (a) holds since 1 + to = 1 + | K /2 1 > K/2. 

Now if we set a = TV and /3 = 1 in ( fT4) i we obtain 

R*(M) > 1 — M/TV 

> ^c(M) 

2 

This implies that 7 (M) < 2 < 4 for M € [TV/2, TV] and concludes the proof. ■ 


V. Lower bounds on the other variants of the coded caching problem 

In addition to the original coded caching problem there are many variants of the problem including coded caching with multiple 
requests p 2 ] , decentralized coded caching El and caching in device to device wireless networks (23). Our proposed strategy 
applies with minor changes for these problems. 


A. Caching in device to device wireless networks 

Wireless device to device (D2D) networks where communication is limited to be single-hop are studied in |[23). There are K 
users who are the nodes of the network. Each user has a cache of size M and TV files are stored across the different user caches. 
Thus, in this setting we necessarily have KM > TV. As in the coded caching problem there are placement and delivery phases. 
In the placement phase the caches are populated from a server; this phase does not depend on the user demands. The server then 
leaves the network. We let Z, represent the cache content of the T-th user. In the delivery phase each user requests a file and the 
remaining users are informed about this request. Based on the requests, each user broadcasts a signal so that all demands can be 
satisfied. We denote by d ^ the signal that is broadcasted in the delivery phase by the i-th user when the j-th user requests 

file dj £ [TV] for 1 < j < K. The delivery signal sent by each user is function of its cache content so that H(X^ (1k \Z/) = 0. 
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We also denote by -X’d 1) ... ) d K the set of signals sent by all the users, i.e., Xd u ,..,d K = {xj^ d ,... ^X^’ dK }. The rate of 
the signal that the /-th user sends in the delivery phase is denoted by R,,d, ,...,<i K (A/). We are interested in lower bounding the 
worst case rate that denoted by R*(M) = K maxi t d lt ...,d k Rj.d, d^ (M). 

The cut-set technique and Han’s inequality have been studied in [23] and |[24| respectively to establish lower bound on R*(M). 
The multiplicative gap established in |23J depends on M and is not constant, whereas )24) shows a gap of at most 8. 

The D2D setting is almost exactly the same as the coded caching setting studied in our work. Our technique for obtaining 
lower bounds is applicable here with essentially no change and we can use Theorem [I] and its corollary. Furthermore, since 
H{xf dK \Zi ) = 0 we can get lower bounds that are somewhat tighter. By treating X,^ ,...,d K as the delivery signal of the 
original coded caching problem, we can our lower bound to show that the multiplicative gap between the achievable rate in 
and our proposed lower bounds is at most 4. The proof is quite similar to that of Theorem [2] and is omitted. 


AK) 


B. Coded caching with multiple requests 

Coded caching with multiple requests is variation of the original problem in which each user requests l files from the server 
in the delivery phase. A straightforward achievable scheme in this setting is to apply the scheme of [9) l times. This problem is 
investigated in [22) where a new achievable scheme is proposed based on multiple groupcast index coding. Furthermore, J22) 
introduce a cut-set type lower bound and show that their scheme is within a multiplicative factor of 18 to the lower bound. In 
contrast, using our approach we can demonstrate a multiplicative gap of 4 for this problem as well. 

In this setting the only difference with respect to the original problem is that from a cache signal Z, and delivery signal X^,.. d K 
one can recover up to l distinct files.Thus, di is a vector of size l containing information about the l files requested by /-th user. 
Therefore, all statements we presented for the original problem are applicable here, bearing in mind that Rec{Zi,Xd 1 ,...,d K ) can 
be as large as l. For instance, an extension of eq. (l8]l gives us L < la min(/I, K). Similarly, the saturation number N sat (a , /?, K , l) 
is defined as the minimum N' among all problem instance P(T, a, 8 , L , N ', K, l) so that L = la min(/\, 0). It is easy to verify 
that N sat (a, /3, K,l) < la min(/3, K ) in a similar way. The following claim can be shown (we omit the proof as it very similar 
to the previous discussion). 

Claim 7: Consider a coded caching system with a server containing N files and K users. Each user has a cache of size M 
and demands l files in the delivery phase. The following lower bound holds for N > Nq where Nq = N sat (a. 3. K . 1), 


aR*(M) + (3M > min (2Zamin(/3, K), lamm((3,K) + {N — !Vo)/2)) ■ 

Similarly, an extension of the Lemma [2] holds so that N sat (a. 3. K, I.) < l(2a3 + a + /?)/3 for 3 < K. Exploiting this upper 
bound and Claim [7J we are able to show that the multiplicative gap of the straightforward achievable scheme and our lower 
bound is at most 4. Let R l c (M) = IR C (M) where R C (M) is defined in eq. 0. 

Theorem 3: Consider a coded caching system with a server containing A/ files and I\ users. Each user requests l files, and 
has a cache of size 0 < M < N. Then 


R l c (M) 

R*{M) 


< 4. 


Proof: We divide the M axis into three regions, 0 < M < ma x(l, N/K), max(l, N/K ) < M < N/2, and N/2 < M < N. 
In each region we show R l c (M)/R*(M) < 4 for any N and K. In the following proof, M = l plays the same role as M 1 in 
proof of Theorem [2] Before embarking on the proof, we note that we only need to analyze the gap for min(7V, IK) > 5. Note 
that the lower bounds of the original problem are also valid here. Indeed, if each user instead of requesting l distinct files request 
the same file l times then the problem will be equivalent to the original one. Now, in ( [14} if we set a = N and 8=1 then we 
get NR * + M > N, or equivalently R*{M) > (1 — M/N), which is applicable to the multiple request problem. Regarding that 
R l c (M) < mm(N,lK)(l — M/N), therefore R l c (M)/R*(M) < 4 for ( N,IK ) < 4. 

1) Region I: 0 < M < max(Z, N/K): For 0 < M < max(7, N/K), we first show that the result holds for M < l. Since we 
separately analyze the gap for M > N/2 we assume l < N/2 so that M < ma x(l,N/K) < N/2. We use result of the Cl aim [7 
with setting a = 1 and 8 = \jnva.(N/2l,K/2)\ where /3 > 1 from l < N/2. Following the exact same steps as in Section IV-A 
for M < 1, it turns out that R*(M) > min(7V, IK)/A > R l c (M)/4 for M < l. 

Now, we assume that l < M < max(l, N/K) which is nonempty if N/K > l. Therefore, we only need to analyze the gap 
for N > IK and l < M < N/K. In this range of M the convex combination of M = 0 and M = N/K is achievable so that 
R l c {M) < XR l c (N/K) + (l-X)R l c (0). From R l c ( 0) = IK and R l c (N/K) = l(K — l)/2 we have R l c {M) < IK(\ - A/2) - IX/2 
where A = KM/N. By setting a = \N/IK~\ and 8 — we have a 8 < aK/2 < N 72Z + K/2 < N/l (from IK < N) 

and that N sot (a, 8, K, l) < la .8 < N. This ensures that the setting is valid for using Claim [7] According to Claim [7] for such a 








22 


setting we have. 


R*(M) > min 2 1/3,1/3 + 


N - N sat {a,P,K,l) 
2 a 


P M 
a 


^ (IK ( _A\ lK{l-X/2) _!£-X)_l_ 
2 v 2 r 2 4 6 


(b) 

> min 


= mm 


R C (M) IK( 1 — A/2) /(I — A/2) 1(1 - A) / 

2 ’ 4 + 2 4 6 

R C (M) IK(1 - A/2) / 

2 ’ 4 + 12, 


> min ( Rc(M) R C (M) ^ > R C {M) 


where inequality (a) can be obtained by making the same argument as we made in first five lines of eq. © and (b) from K > 2. 
2) Region II: max(/, N/K) < M < N/2: In the first step, we try to get an upper bound on the achievable rate. Letting 


to = \KM/N\ and following the argument we made in Section 


IV-B 


gives us R l c {M) < IR C (M) < l ( N/M — 1/2) for M in 


this range. Next, by setting a = [2M/l\ and /? = \_N/2M\ we have N sat (a, p, K,l) < la/3 < N and /? < 2 N/M < K/2 by 
M > N/K which imply that the constraints of the Claim [7] are satisfied. Therefore, 


R* 


> min ( 2 ip, Ip + 


N — N sat (a, P, K, l) \ pM 


> min ( 2 lp ( 1 — /-/— 1 , -2L_ 


(b) 

> 



2 a 


M \ 

HP , 

N - 2PM 

2 la ) 

’ 12 + 

2 a 

M \ 

lip , 

N - 2 PM 

2 M ) 

’ 12 + 

4M/1 


a 

l 

" 6 
l 

~ 6 


(c) 

> 


Nl Nl 

4M’ 4 M 


12 


> K(M)/ 4 , 


where in (a) we used upper bound on N sat (a, P, K,l) and that p/a < P/2 (from a > 2), in (b) we used N — 2/3M > 0, 
a < 2 M/l, and a > 2 M/l — 1 > M/l (from M < l). In (c) we used p > K/ 4 (for I\ > 2) and p > 1 (from M < N/2). 

3) Region III: N/2 < M < N: Using the same argument we made in Section |IV-C the achievable rate is bounded by 
R l c (M) < IR C (M) < 2/(1 — M/N). According to Claim [ 7 ] by setting a = [N/l\ and p = 1 one may not recover all 
N files since al < N, but if we increase a to [IV//] then all files will be recovered. Therefore aR*(M) + M > N or 
equivalently R*(M) > (N — M)/a. From N — M > 0 and that a < N/l + 1 < 2N/1 (since / < N) it turns out that 
R*(M) > /(I - M/N)/2 > AR l c (M) for N/2 < M < N. This concludes the proof. ■ 


C. Decentralized coded caching 

In the original coded caching problem the placement phase is managed by a central server. However, in many scenarios such 
coordinated placement phase may be impractical. Instead, a decentralized placement phase was investigated in 03 where the 
users cache random subsets of the bits of each file while respecting the cache size constraint. Even in this setting a multiplicative 
gap of 12 to the cut-set lower bound was obtained. Note that the lower bounds established for the centralized coded caching 
problem are also applicable to the decentralized case. By similar techniques to those used in proof of Theorem [2] we can establish 
a multiplicative gap of 4. The proof is omitted as it is quite similar. 

VI. Comparison with existing results 

Lower bounds on the coding caching rate have been proposed in independent work as well. In this section we compare our 
lower bounds with other approaches. 

A. Comparison with cutset bound 

Our first observation is that the cutset bound in [9) is a special case of the bound in eq. 0. In particular, suppose that 
a = [IV/s], p = s for s = 1,..., min(/V, K). In this case, we have aP < N. Thus, it is easy to construct a problem instance 
where L = aP (see Corollary [I]). This also follows from observing that N sat (a, P, K) < aP. 
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Our bound allows us to explore a larger range of (a, j3) pairs that in turn lead to better lower bounds on R*. Suppose that for 
a coded caching system with TV files and I\ users, we first apply the cutset bound with certain a± and j5-\ such that a j 3-\ < TV. 
This would result in the inequality 

aiR* + p\M > aif3\. 

However, our approach can do strictly better. To see this note that a\Pi < TV implies that N sat (ai, j3i, K) < TV. Now, using 
Corollary [2] we can instead attempt to lower bound 2on R* + 2/3\M and obtain the following inequality. 

2«i R* + 2/3\M > min (4ai/3i, 2 aifii + TV - N sat (ai, /3i, K)) 

=> aii?* +fi\M > min(2aiy0i, ai/3i + (TV - TV snt (ai, /3i, K))/2 ), 

which is strictly better than the cutset bound since TV — N sat {o.\,P\,K) > 0. 

Example 7: Consider a system containing a server with four files and three users, TV == 4 and I\ = 3. The cutset bounds 
corresponding to the given system are 

4 R* + M> 4, 

2R* + 2M > 4, and 
R* + 3 M > 3. 

A simple calculation shows that if M = 1, the above inequalities, yield the lower bound R* > 1. 

Now, consider the second bound, 2R* + 2 M > 4 and instead attempt to obtain a lower bound on 4 R* + 4 M. In this case 
by exhaustive enumeration, it can be verified that N sat ( 2,2,3) = 3 < TV. Using Corollary^] this results in the lower bound 
L* > min(4 x 3,2x4 + 4 — N sat (2 ,2,3)) = 9. Thus we can conclude R* + M > 2.25 which is better than the cutset bound 
R* + M > 2. Moreover, this inequality also yields a better lower bound R* > 1.25. 


B. Comparison with lower bound oim 
The authors in 03 use Han’s inequality 


Theorem 17.6.1] to establish the following lower bounds on the coded caching 

E r sr - oi+ _ r ;\r _ 7>'i + 


(16) 


problem. 

aR*(M) + /3 M>N -^ [TV - a/3] + - [TV - aK 

where p = min(|~ jV ~ ct/3 ], K — /?), /3 £ {1,... ,A'} and a £ {1,..., This bound also provides more flexibility in the 

choice of a as compared to the cutset bound. 

An analytical comparison between our bound and the bound in inequality ( [T6| is hard, especially since a priori in all these 
bounds, for a given M, it is unclear which particular (a, j3) pair gives the best lower bound. Thus, in the discussion below we 


attempt to analytically compare the bounds for given (a,/ 3). We also present a numerical comparison in Section VI-E 

(a) Our bound is superior, when l/a + l//3 < 0.4, i.e., when the values of a and /3 are large enough. Note that the best lower 
bounds on R*(M ) for systems with TV and K reasonably large are obtained for higher values of a and /3. Thus, for most 
parameter ranges our bounds are better. 

(b) The bound in [TO:] is better when a = 1 and TV < K. This in turn means that their corresponding lower bound for small 
values of M is better than ours. 

(c) We can demonstrate that our proposed lower bound is within a factor of four of the achievable rate, whereas 03 only 
demonstrates a multiplicative gap of eight. 

In the remainder of this discussion we assume that a > 2 and show these claims. Let L* denote the value of our lower bound 

and let L u denote the lower bound of [ K)|. 

Case 1: a/3 > TV. 

Note that a < \N//3~\ in inequality ( fl~6| >. Furthermore, a > 2 implies that TV > /3. Thus, we can conclude that a/3 < fTV//T] 3 < 

2TV. Now, we use Corollary Ul to compare the bounds. Specifically, set ai = \a/2~\,/3i = |_/3/2J ,a r = \a/2\ and f3 r = \j3/2~\. 

This implies that 

a(3 

ma x(ai/3i,a r l3r) < — < TV. 

Thus, we obtain L* = min (a/3, aifii + a r /3 r + TV — TVq). Note that 


N 0 = max {N sat (ati, Pi, K), N sat (a r , /3 r , K )) < max(a jy 8j, a r /3 r ) < TV. (from above) 
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Thus, 

L* = min {a/3, ai/3i + a r /3 r + N — N 0 } 

> min{a/3, ai/3i + a r /3 r + N — max(cq/3;, a r /3 r )} 
= min{a/3, min (cq/3j, a r /3 r ) + N} 

> N. 

On the other hand note that Lh is at most N. Thus, our bound is strictly better. 

Case 2(a): a/3 < aK < N. 


As N > a/3 > N sat (a, f3, K) we use (14 1 to obtain 


L* = min (amin(A', 2/3), a/3 + (N 


No)/2). 


The corresponding bound Lh is obtained by setting //, = K — (3. 

L h = aK - (1 - /3/K)(N - a/3) 

= a/3( 1 + 1/x — x) — (1 — x)N, (where 0 < x = /3/K < 1) 

< a/3(2 — x), (since, N > aK = a/3/x). 

Thus, we conclude that Lh < min(aA/ a/3(2 — x)) < a min(AT, 2/3). As a result, we only need to examine whether a/3 + (N — 
Nq)/2 > Lh- Now, using the fact that Nq < (2 a/3 + a + (3)/ 3, we have that L* > Lh when 


2a/3/2> + N/2 - (a + /3)/6 > a/3(l + 1/x - x) - (1 - x)N 
(3/2 - x)N - (1/x + 1/3 - x)a/3 - (a + /3 )/6 > 0. 


As N > aK = a/3/x, inequality (17 1 certainly holds if 

(l/2x + x- 4/3)a/3 - (a + /3)/6 > 0. 


(17) 


It can be verified that 1 /2x + x — 4/3 > y/2 — 4/3 > 1/15 for 0 < x < 1, so that the above inequality will definitely hold if 
0.4 > 1/a + l/p which is the case for a,/3>5. 

Case 2(b): a/3 < N < aK. 

In this case fi = \N/a — /3~\, so that 

L h < N - (1 - a(3/N)(N - a/3) 

= q/3(2 — x') (where 0 < x' = a/3/N < 1) 


As in the previous case, we conclude that L* > Lh if 

2o/?/3 + N/2 - (a + /?)/6 > a/3(2 - x'). 

Upon analysis similar to the previous case, we can conclude that our bound is better when 0.4 > l/o + 1//3. 

C. Comparison with lower bound 

The work of ED is closest in spirit to our proposed lower bound. In particular, we show that their lower bound corresponds 
to specific problem instance as defined in our work. We note however that the work of ED does not analyze the multiplicative 
gaps between the achievable rates and lower bounds. The lower bounds in ED can be rewritten as 


2mR* + 2tmM > Lq, for t < N, K > 2 (18) 

2 tmR* + 2rnM > L 0i for t < N, K > 21, 


where Lq = min{4fm 2 , 2 tm 2 + N — No}, 3Vo = t(m 2 — m + 1), m = n — 7 and n = [~(f + yjt 2 + 12 t(N — f))/ 6 f]. Also, 
7 = max(0, \n — K/2t\) and 7 = max(0, \n — AT/2]) in the hrst and second lower bounds respectively. We present these 
bounds using our notation so that (a, (3) is equal to (2m,2tm) and (2tm,2m) in the hrst and second lower bounds in (I 81 
respectively. Note however, that in the above bound the only free parameter is t, i.e., m itself is dependent on t. It is easy to 
see that (3 < K therefore, unlike our method, this method cannot be used to obtain lower bounds when f3 > K. 

The lower bound L$ in eq. (18 1 above is reminiscent of our lower bound if the term Nq is interpreted as a bound on the 
saturation number. In fact, for the specific setting of (a, (3) = ( m,mt ), we can create a problem instance as described below. 
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Fig. 


10: Problem instance associated with the lower bounds in 


U3 


that is a saturated instance with exactly t(m 2 — m + 1) files, so that we can infer that N sat (m,tm, K) < t(m 2 — m + 1). It 
turns out that this upper bound on the saturation number may be slightly stronger than the one we derived in Lemma [2] for 
general a and /3 when t and m are small. The associated problem instance of the first lower bound in (| 1 8| is depicted in Fig. 


10 The corresponding instance for the second lower bound in (|T8) can be derived in a similar manner. In this figure, delivery 


phase signals ID(ui),..., B(u 2 m ) are same as the delivery phase signals defined in |11J. For this tree, it can be verified that the 
instance can be saturated with t(m 2 — m + 1 ) files, so that N sat (m, tm , I\ ) < t(mr — m + 1 ). 

However, an application of Algorith m [3] will result in even better upper bound on the saturation number as shown in the 
example below. In particular. Algorithm [3J will generate a different tree when trying to upper bound the saturation number. 

Example 8: We consider a system with TV = 64 files and K = 8 users and set t = 2 in eq. (18 1 so that m = 4 and N 0 = 26. 
Algorithm [3 for such a setting returns A^ ot (4, 8,8) < 22 which is smaller than TV 0 . This reduction in saturation number is a 
consequence of splitting a and /3 equally in the Algorithm ([3]» and continuing recursively thereafter. On the other hand, it can 
be noted that in Fig. r ^ T ’ 

(cf Claim [3- 


10 node it* is such that it has m = 4 incoming edges which makes the corresponding lower bound looser 


D. Comparison with results in l\29^ 

In ]29) the author provides lower bounds for the specific case of N = K = 3. The inequalities are generated via a computational 
technique that works with the entropic region of the associated random variables. Some of the bounds presented in [|29]| can be 
obtained via our approach as well. However, the specific inequalities 3R*+6M > 8, 18/C + 12M > 29 and 6R*+3M > 8 cannot 
be obtained using our approach and strictly improves our region. Note however, that it is not clear whether these inequalities 
can be obtained in a computationally tractable manner for the case of large TV and K. 


E. Numerical comparison of the various bounds 

We conclude this section, by providing numerical results for two cases: (i) N - 16, K - 30 and (ii) N = 64, K = 50. In 
Fig. 11 the ratio R C {M)/R*(M) is plotted by lower bounding R*(M) by different methods. In case I (see Fig. [TT]) we have 


TV = 16 and I\ = 30. Our bound has the minimum multiplicative gap except in the small range 0 < M < 1. Specifically, as 
discussed previously, the bound in 03 is better than ours when K > TV and a = 1 and 0 < TVf < 1. In case II, where TV > K 
our bound has minimum multiplicative gap for all range of M. 


VII. Conclusions and Future Work 

In this work we have considered a coded caching system with TV files, K users each with a normalized cache of size M. We 
demonstrated an improved lower bound on the coded caching rate R*(M). Our approach proceeds by establishing an equivalence 
between a sequence of information inequalities and a combinatorial labeling problem on a directed tree. Specifically, for given 
positive integers a and /3, we generate an inequality of the form aR* + BM > L. We showed that the best L that can be 
obtained using our approach is closely tied to how efficiently a given number of files can be used by our proposed algorithm. 
Formalizing this notion, we studied certain structural properties of our algorithm that allow us to quantify the improvements that 
our approach affords. In particular, we show a multiplicative gap of four between our lower bound and the achievable rate. An 
interesting feature of our algorithm is that it is applicable for general value of TV, K and M and is strictly better than all prior 
approaches for most parameter ranges. 

There are still gaps between the currently known lower bounds and the achievable rate and an immediate open question is 
whether this gap can be reduced or closed. It would also be of interest to better understand coded caching rates in more general 
scenarios such as the hierarchical coded caching setup and for more general network topologies. 
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R c /R* R c /R * 




Fig. 11: The plot demonstrates the multiplicative gap between the achievable rate, R C (M), in [9] and lower bounds R*(M) using different 
lower bounding techniques. For case II our lower bound results in the least multiplicative gap. In case I, where N < K , the multiplicative gap 
obtained by our proposed lower bound is lower than the others for M > 1. In the range 0 < M < 1, |T0) provides a slightly better result. 
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Appendix 

Lemma 3: Algorithm [TJ always provides a valid lower bound on aR* + (3M where a = l®( w i)| and P = 1 l^(' y *)l- 

Proof: Consider any internal node v £ T. We have 

E H(Z(u)UD(u)\W(u)UW new (u)), 

u£.in(v ) 

( a ) x —. 

> E H(Z{u)UD(u)\W(v)), 

uE.in(v ) 

( 6 ) 

> rf(Z(v)UD(u)|W(w)), 

= I(W new (v)-, Z(v) U B(v)|W(t;)) 

+ H{Z(v) U D(v)|W(v) U W new (v)), 

where inequality in (a) holds since W(u)UW Tletf ,(u) C W(v) and conditioning decreases entropy, (6) holds since U ue i n ^Z(u) = 
Z(v) and U tie j n („)B(u) = B(u) and (c) holds by the definition of mutual information. Let Vi nt denote the set of internal nodes 
in T. Let v* denote the root and (u*,v*) denote its incoming edge. Then, 

E E H(Z(u) U B(u)|W(u) U W new {u)) > 

v£Vint u£in(v ) 

E y(v,out(v))+ E H(Z(v) U B(v)|W(«) U W new {v)), 

VEVint VeVint 

where we have ignored the infinitesimal terms introduced due to Fano’s inequality (for convenience of presentation). Note that 
the RHS of the inequality above contains terms of the form H(Z(v) UlD(i')|W(i;) U W new (v)) for all nodes v £ Vi n t (including 
u*). On the other hand the LHS contains terms of a similar form for all nodes including the leaf nodes but excluding the node 
u*. Canceling the common terms, we obtain, 

i 

E H(Z(vj) UD(t)i)|W nem («i)) > 

i= 1 

E V(v,out{v)) J +H(ZuB(u*)\-W(u*),W new (u*)), 

v&Vi ) 

since W (vf) = (f) for i = 1,..., i. We can therefore conclude that 


y ^ ID(p^)) ^ y ^ y(v,out(v)) 

i—1 v€V 

I I 

^2H(Z(vi)) + ^2H(B(vi)) > E y^autlv)) 


i=1 


i—1 


vev 


Noting that M > H(Z(vi)) and R* > H(D(vi)) we have the required result. 


(19) 


( 20 ) 
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Fig. 12: Tree modification example 


A. Proof of Claim [7] 

Proof: We iteratively modify the problem instance P(T,a, f3, L, N, K) to arrive at an instance where every node has in¬ 
degree at most two. Towards this end, we first identify a node u with in-degree <5 > 3 such that no other node is topologically 
higher than it (such a node may not be unique). 

We modify the instance P by replacing zz with a directed in-tree where each node has in-degree exactly two. Specifically, 


arbitrarily number the nodes in in{u) from v[,...,v'g. We replace the node zz with a directed in-tree T u with leaves v[ 


and root zz. T u has <5 — 2 internal nodes numbered u \, 


f s _ 2 such that in{u'f) = {u' i _ 1 ,v , i+1 } where u' 0 = v[ (see Fig. 


P’S 

12b. Let 


us denote the new instance by P a = P 0 (T 0 , a, (3, L 0 , N, K). We claim that L 0 > L. To see this, suppose that W* £ W^e W {u)- 
We show that W* £ U„/ ^7; W p : w (if ). This ensures that L 0 > L. To see this we note that 


Z p (zz) = Z p °(zz) 

D p (zz) = B p °(zz), and thus, 
A p (zz, zt) = A p °(zz, zt). 


Thus, if W* £ W p ew (u), there exists an internal node v! i £ T u with the smallest index i £ {1,..., <5 — 2} such that W* £ 
A p °(zz', zz'). Note that if i > 1, we have W* £ W p ° w {u'f) since W* ^ A p °(z i' i _ 1 ,u , i _ 1 ) which in turn implies that W* 
W p °(zz'). On the other hand if i = 1, then a similar argument holds since it is easy to see that W* ^ W p °(zz' 1 ). 

Note that the modification in the instance P can only affect nodes that are downstream of zt. Now consider zt' such that 
zz £ in(u'). It is evident that Z p °(zz') = Z p (zz') and B p °(zz') = L> p (zz'). Moreover W p °(zz') = U 1)gin ( u /)W P °(z;) U W p ° w (v). 
Now for v j- zt, W p °(v) = W p (v) and W p ° w {v) = W p ew {v) as there are no changes in the corresponding subtrees. Moreover, 
as A p (zz,zz) = A p °(zz,zz), we have that W p °(zz)U W p ° w (zz) = W p (u)UW p ew (u). This implies that W p °(u') = W p (zt'). Thus, 
we can conclude that W p ° w {u') = W p ew [u!). Applying an inductive argument we can conclude that the W p ° w {n') = W p ew (u') 
for all v! such that zt A v!. 

The above process can iteratively be applied to every node in the instance that is of degree at least three. Thus, we have the 
required result. ■ 


B. Proof of Claim [7] 

Proof: We identify the set U as the set of all nodes in T such that the specified condition in the claim holds. Let U* C lJ 
denote the set of nodes that are highest in the topological ordering . We modify the instance in a way such that a node zt* £ U* 
can be removed from U , i.e., the specified condition no longer holds for it. Moreover, our modification procedure is such that a 
node u >- u* cannot enter U at the end of the procedure. 

We now discuss the modification procedure. In the discussion below, for a given node u, we can consider the instance obtained 
with tree T u . We let j3 u denote the number of cache nodes in this instance. Note that for zz*, the condition j3* < min(/3*. K) 
holds. This implies that there is a set of cache leaves in T u * denoted {z^,... ,z;j m } such that Z(zzj 1 ) = ■ • • = Z(z;j m ) = {Zj}. 
Let A = {zz £ T u * : ( Vi a ,Vi h ) meet at zz, for all distinct z ’i a ,Vi b £ {v^,... We identify zzo £ A such that no element of 

A is topologically higher than uq (note that zzo may not be unique) and let vf and v* be one pair of the corresponding nodes 
in {zzj i; ... ,Vi m } that meet at zzo- W.l.o.g we assume that v* h £ T Uo (r) an d v * a £ T Uo {i)- 

We claim that zzo = zz*. Assume that this is not the case. Since zzo £ T u - we have zzo A u*. Using this and the fact that 
u 0 f: U we have j U„ e c„ 0 Zi(v)\ = min(|C Uo |, K). Now, from v* a ,v* b £ C Uo and that Z(z;*J = Z(z)*J we conclude that 
min(|C Uo |, K) = K. Moreover, as Z(zz) C U u ^j- u ,Z(u) we have (3 = I\ which contradicts $ < min(/3, K). Therefore 

zzo = zz*. 

We construct instance P’ (with lower bound L') as follows. Choose a member of {Z 1 ,... ,Zk} \ {Zj(v') : v' £ C u *} and 
denote it by Z^. We set Z p (v* b ) = {Zkj. Also, for any zz £ T Uo (r) an d D p (zz) = ■Z'd 1 ,...,d K we set D p (zz) = suc h 

that dj = dk and d' k = dj and d! i = di for i ^ {j, k }, i.e., we interchange the j-th and fc-th labels and keep the other labels the 
same. With this modification, it can be seen that f3* = min(/3*,fv). 
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For nodes u >- u*, the change we applied to cache nodes in C u * to get P' is such that f} u continues to equal min(/3„. K) 
since Z k is chosen from {Z i,..., Zk} \ {Z(z/) : v' £ C u *} 

We now show that L' > L. In particular, for u £ T Uo (i ), we have W p ew (u) = W p ew (u), as there are no changes in the 
corresponding labels. Also we claim that W p ew (u) = W p ew (u) for u £ T Uo ( r )- To see this, note that for v £ P Uo (r) and 
v' £ C Uo(r ) we have A p '(v',v) = A p (v',v) if Z(v') g {Zj,Z k }. If Z p ’(v') = {Z k } and D p '(u) = X d ^,^ d > K then, 

A p \v',v) = Rec{{Z k },{X d ^ d , K }) 

= {W d ' k } = {W dj } 

= Rec({Zj}, {X dlr .. tdic }) 

= A p (v', v ). 

Furthermore, note that there does not exist any v’ £ Cu 0 ( r ) such that Z(v') = {Zj } since we picked uq such that no element of 
A is topologically higher than uq. From eq. ({5]) and (jhji, it is not hard to see that this in turn implies that W p ew (u) = W p ew {u ) 
for U £ Tu 0 ( r ) • 

It follows therefore that W p (uq) = W p (uo) (from eq. Let us now consider the other nodes. As the changes are applied 
only to T Ua (r) so label(u) changes only for nodes u such that uq u. Consider the subset of internal nodes U = {wo, wi, • • •, Ut} 
such that (m, «,;+() is an edge, i.e., the set of internal nodes including uq and all nodes downstream of uq such that ut is the last 
internal node. W.l.o.g we assume that 1 £ T Ui m for i > 1. We now show that U u euW p ew (u) C U u euW p ew {u). Towards 
this end we have the following observations for u £ U. 

7L P (u) = Z p (u) U {Z k j (from the construction of P') 

A p (it, it) = U vG v„A p (u,v). 

Now, for v ^ V Uo {r) we have (u) = B p (u) so that 

A p (u,v) = Rec{Z p (u),lD p (u)) 

= Rec(Z p ' {u),D p {v)) 

2 A p (u,v)( since Z p (u) 2 Z p (u)). 

Conversely for v £ T > Uq ( r ) we have 

Rec (iZj,Z k }, B p '(u)) = Rec {{Z 3 ,Z k }, D p (v)), 

and 

f?ec({ZJ,D p '(u)) = Rec({Z t }, B p (u)) (for Z t £ {Z 3 ,Z k }). 

Now, note that {Z k , Zj} C Z p (it) so that 

A p (u,v) = Rec (z p (zt),B p (v)J 
= Rec(z p \u),D p {v) S j , 

2 Rec (Z p (u), D p (u)) = A p (it, v), 
since Z p (it) 2 Z p (u). We can therefore conclude that 

A p (u,u) = U„ e D u A p (u, v) C U„gp ti A p (u,v) = A p (u,u). 

Now we consider a W* £ W p ew (ui) so that W* £ A p (z H,Ui) which by above condition means that W* £ A p (iq,it,;). Thus 
either W* £ W p ew (ui) or W* £ W p (iq). In the latter case there exists a node it,,/ where 0 <i'<i such that W* £ W p ew (ui') 
since W* ^ W(ito) and we have shown that W p (ito) = W p (uo). Thus, we observe that 

L' = I ^ueU W P ew (u)\ + ^2 \Wnew{ u )\i 

ueT',u£U 

> I U„ ei 7 W P ew ('it) I + I W new (u) |, 
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where the second inequality holds since E„ g t' u^u \ W newi u )\ = E ue r u<£u \ W new( u )\ and I u «et/ W£ w (u)\ > \ U uGU 
W p ew (u) |. 

As discussed before, the modification procedure is such that at the end of the operation u* (f U. Moreover nodes u >~ u* are 
not in U either. For each node u £ U let d(u) denote the number of edges in path connecting u to the root node. Our modification 
procedure is such that d* = max u au d{u) is guaranteed to decrease over the course of the iterations. Indeed, if \U*\ = 1, then at 
the end of the iteration d* will definitely decrease. If \U*\ > 1, then d* will definitely decrease after the modification procedure 
is applied to all the nodes in U*. Thus, the sequence of iterations is guaranteed to terminate. This observation concludes the 
proof. 


C. Proof of Lemma [7] 

Proof: 

Given the conditions of the theorem, from Corollary [I] we can conclude that there exists an index i* £ {1,.... a } such 
that E „>ec ip(v.i*,v') < min(/3,A'). We set i* to be the smallest such index. Let II 1 (ui*) = {V £ C : ip(vi*,v') = 1} and 
n 0 (uj*) = {v' £ C : ij)(vi*,v') = 0, Z(u') ^ U„ 6 n Note that 11°(«»*) is non-empty since |U„' G c2i('t' / )| = min(/9, K ) 

and EEeC ' t P{ v i*i v ') < min(/3, K). 

Next, we determine the set of nodes where Vi * and the nodes in 11° (uj*) meet, i.e., we define A°(ui*) = {u £ T : 3v' £ 
n°(ui*) such that Vi * and v' meet at rt.}. Note that there is a topological ordering on the nodes in A°(uj*). Pick the node 
u* £ A °(vi*) such that no element of A °(vi*) is topologically higher than u* (u* is in the path from Vi * to the root node). Let 
the corresponding node in II 0 (uj») be denoted by Vj* where j* £ {a + 1,..., a + /3}. Note that Vj * might not be unique. 

Suppose that Z(u/») = { Zk} and that D(uj») = X We modify the instance P as follows. Set c4 = N + 1 (i.e., the 
index of the N + 1 file). Thus, the only change is in D(uj»). Let us denote the new instance by P' = P(T ’, a, /?, L\ N + 1, K). 

We now analyze the value of L'. W.l.o.g. we assume that and Vj* £ T^.r r y Note that W^ ew {u) = W^ ew {u) for 

u £ 7^. (r ) as the subtree T^,t r \ is identical to T u *(r)- We also have 

W^ewW) = W^ ew {u) for u £ %, {l) . 

To see this suppose that this is not true. This implies that the file Wjv+i is recovered at some node in T' it (l y i.e., there exists 
v' £ C such that v' £ Z(V) = {Zk}, and that v' and Uj* meet at some u >- u*. From Vj- £ n 0 (uj») we can conclude 

that {Zk} ^ and v ' C n°(ui») (as Z(u') = {Zk}). However this is a contradiction, since this implies the existence 

of node u that is topologically higher than u* in the set A°(vi*). It follows from eq. ^ that W p (u*) = W p {u*). 

Next, we claim that W^ ew {u*) = W^ ew (u*) U {Wn+i}. To see this consider the following series of arguments. Let the 
singleton subset A p (vi*, Vj-) = {W*}. Note that fi p (vi -, Vj *) = 0. This implies that there exist v £ V u . and v' £ C u * such 
that v and v' meet above u* and recover the file W* where (v,v') ('Cj-, v 3 *). Thus, as Z p (u*) = Z p (u*), we can conclude 

that 

A p '(u*,u*) = Rec{Z p ' (u*),B p ' (u*)) 

= Aec(Z p (zt*),D p '(u*)) 

= A p (u*,u*)U{W n+1 }. 

Furthermore, we have 

W P ew(u*) = A P '(u*,U*)\W P '(u*) 

= A p (u*,u*) U {Wn+i} \ W p (u*) 

= W p ew {u*) U {W N+ J, (since W N+1 £ W p (u*)). 

For u such that u* >- u we inductively argue that W p ew (u) = W p ew {u). To see this suppose that u* = u r . It is evident that 
A P i(u) = A p (u). Next, A p '(u) = A p (u) since Z k (£ Z(ui) \ Z (u r ). Thus, 

W p ; w (u) = A p / (u) U A p '(n) \ W p '(u) 

= A p (u)UA p (u)\W p '(u) 

= A p (u) U A p (u) \ W p (u) U {W N+1 } 

= A p {u) U A p (u) \ W p (u) (since Wjv+i i X p (u) U A p (u)) 

= W p ew {u). 
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Next, we note that W (u) = W(u r )UW new (u r )UW(ui)UW new (ui). It is evident that W p (■zq) = W p {ui) and W p ew (ui) = 
W p {ui). Next, W p, (u r ) = W p '(u*) = W p (u*) (from above) and W p ' w (u*) = W p ew (u*) U {W N+1 }, so that W p> (u) = 
W p (u) U {W N+ 1 }. 

As the induction hypothesis we assume that for any node u downstream of it*, we have W p ew (u) = W p ev ,(u) and W p (u) = 
W p (u) U {Wn+i}- Consider a node u! such that v! r = u. As before we have W p (u[) = W p (u[), W p ew (u[) = W p ew {u[). 
Moreover, we have W p (u' r ) = W p (z4,)U{kI7v + i} and W p ew (u ' r ) = W p ew (u' r ), by the induction hypothesis, so that W p (vf) = 
W p (u') U {Wn+i}- 

Next, we argue similarly as above that A p (u' ) = A rl( U ') and A Fr («') = A lM’) and the sequence of equations above can 
be used to conclude to that W p ew (u') = W p ew (u'). 

We conclude that V = L + 1. ■ 

D. Proof of Claim [5] 

Proof: 

W.l.o.g we assume that |Fj| > L r for all u £ T. We identify the set U as the set of nodes in T such that I’ T f 1’;. Let 
U* CU denote the set of nodes in U that are highest in the topological ordering. 

Consider a node u* £ U*. Note that since |L;| > |F r |, there exists an injective mapping ft • F r \ T; —► T; \ T r . Let 
Z(u*) = {Zi l: ..., Zi m }. We construct the instance P' as follows. For each v £ T> u * suppose D(u) = {Xd lt ... t d K }- P* 31 
j = 1,..., m, if dij £ T r \T[, we replace it by <^(dj ); otherwise, we leave it unchanged. In other words, we modify the delivery 
phase signals so that the files that are recovered in T u -( r \ are a subset of those recovered in 7),-(; t . 

As our change amounts to a simple relabeling of the sources, for u £ T u *( r ) we have \W p ew (u)\ = \W p ew {u)\. For any 
u ^ it* we have r p (u) C T p [u). Similarly, we can show that T p (u) C T p (u). We note that T p and T p only differ in files 
like Wd where d is in domain of </>(•), i.e., if Wd £ T p then Wm) C T p . If there exist a file Wd £ T p (u) with d in domain of 
</>(■) then W^d) £ F p (u) and from r p (w) C T p (it) we have W^d) G Tf ( u ). Thus, we have T p (it) C T[ (it). This indicates 
that after applying this change, the property of r r C T; still holds in P 1 for all nodes u that are upstream of it*. Furthermore, 
the relabeling of the sources only affects u £ T 7 such that u* y u. Note that W p (u*) C W p (u*) (the inclusion is strict since 
at least one source in r r \ T; is mapped to T; \ r r ) since we have T p C T p and T p = T p . 

Now, we note that 

A ri'(u*) = A p (u*), and 

A P V) = A p (u*), 

where the first equality holds since Z p (u*) = Z p ( u *), Z p (rt*) = Z p (u*) and D p (it^) = D p (u*). The second equality holds 
since our modification to the delivery phase signals in T u *(r) does not affect files that are recovered from Z p (u)') \ Z p (u*). It 
follows therefore that \W p ' w (u*)\ > \W p ew {u*)\. 

We make an inductive argument for nodes u that are downstream of u*; w.l.o.g. we assume that u* £ T u ( r y Specifically, 
our inductive hypothesis is that for a node u that is downstream of u *, we have W p (u) C W p (u), A £ (w) = A p (u) and 
A ir{u) = A f r {u). 

Now consider a node u' downstream of u such that u' r = u. We have, W(w') = W{u[)CW new {u[)CW(u)CW new (u). Note that 
we can express W(u)U W new (u) = W(u)UA r i(u)UA/. r (u) . It is evident that W p (■ u [) = W p (zi() and W p ew (u[) = W p ew (u[). 
Moreover, by the induction hypothesis, W p (u) C W p (u) and A p ’ ( u ) u A p ' (u) = A p («) U A p (u). Thus, the induction step 
is proved. 

We have shown that after applying the changes for u*, the condition L r (7 L; will not hold for u f u*. For each node 
u £ U let d(u) denote the number of edges in path connecting u to the root node. Our modification procedure is such that 
d* = max u6 u d(u) is guaranteed to decrease over the course of the iterations. Indeed, if \U*\ = 1, then at the end of the iteration 
d* will definitely decrease. If \U*\ > 1, then d" will definitely decrease after the modification procedure is applied to all the 
nodes in LA*. Thus, the sequence of iterations is guaranteed to terminate. This observation concludes the proof. ■ 

Claim 8: Under condition of fti = min (fti,K) and ft r = min(/3 r , K) we have min (fti,K — ft r ) = [min {fti,K — ft r )] + and 
min(/3 r , I< - fti) = [min(/3 r , K - fti)} + . 

Proof: First, we consider the case where fti + ft r < K so fti < K — ft r and [min (fti,K — ft r )] + = fti- By assumption, 
fti + ftr < K implies fti + ft r < K thus min(/3j, K — ft r ) = fti = fti. We now consider the fti + ft r > K case which in turns 
leads to fti + ft r > K. Therefore, 

mm(fti, K — ft r ) = K — ft r = K — min(A', ft r ) = max(0, K — ft r ) = [K — ft r ] + = [min (fti,K — ft r )] + ■ 

The same argument will show that min(/3 r , K — fti) = [min(/3 r , K — fti)] + . ■ 
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Claim 9: Consider the integers a, ai,a r , /3, (3i,/3 r , K so that a = ai + a r and /3 = /3j + /3 r . Then 

a min(/3, K ) = ai min(/3j, K) + a r min(/3 r , K) + aj[min(/3 r , K — (3i)] + + a r [min(/3;, K — /3 r )] + . 

Proof: First, we consider the case where 6 < K thus ff < K — f3 r and li r < K If . Then, the above relation reduces to 
a/3 = ai(3i+a r f3 r +ai/3 r +a r l3i which is true. For the case /3 > K, the relation reduces to aK = ai (min(/3;. K) + [K — /3;] + ) + 
a r (min(/3 r , I \) + [K — /3 r ] + ). This equation holds since min(/3z, K) = K — [. K — /3;] + , and same thing for j3 r - ■ 



